<86>Sep 16 01:23:15 userdel[627520]: delete user 'rooter' <86>Sep 16 01:23:15 userdel[627520]: removed group 'rooter' owned by 'rooter' <86>Sep 16 01:23:15 userdel[627520]: removed shadow group 'rooter' owned by 'rooter' <86>Sep 16 01:23:15 groupadd[627535]: group added to /etc/group: name=rooter, GID=1796 <86>Sep 16 01:23:15 groupadd[627535]: group added to /etc/gshadow: name=rooter <86>Sep 16 01:23:15 groupadd[627535]: new group: name=rooter, GID=1796 <86>Sep 16 01:23:15 useradd[627552]: new user: name=rooter, UID=1796, GID=1796, home=/root, shell=/bin/bash, from=none <86>Sep 16 01:23:15 userdel[627577]: delete user 'builder' <86>Sep 16 01:23:15 userdel[627577]: removed group 'builder' owned by 'builder' <86>Sep 16 01:23:15 userdel[627577]: removed shadow group 'builder' owned by 'builder' <86>Sep 16 01:23:15 groupadd[627598]: group added to /etc/group: name=builder, GID=1797 <86>Sep 16 01:23:15 groupadd[627598]: group added to /etc/gshadow: name=builder <86>Sep 16 01:23:15 groupadd[627598]: new group: name=builder, GID=1797 <86>Sep 16 01:23:15 useradd[627615]: new user: name=builder, UID=1797, GID=1797, home=/usr/src, shell=/bin/bash, from=none /usr/src/in/srpm/rccl-2.18.6-alt0.1.src.rpm: bad symbols in the license tag: // <13>Sep 16 01:23:19 rpmi: libidn2-2.3.7-alt1 sisyphus+339505.100.1.2 1706718968 installed <13>Sep 16 01:23:19 rpmi: libnettle8-3.9.1-alt1 sisyphus+322548.100.1.2 1686176879 installed <13>Sep 16 01:23:19 rpmi: libp11-kit-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Sep 16 01:23:19 rpmi: libtasn1-4.19.0-alt3 sisyphus+327816.100.1.1 1692802615 installed <13>Sep 16 01:23:19 rpmi: libhogweed6-3.9.1-alt1 sisyphus+322548.100.1.2 1686176879 installed <13>Sep 16 01:23:19 rpmi: libgnutls30-3.8.4-alt1 sisyphus+343729.100.2.1 1711571288 installed <13>Sep 16 01:23:19 rpmi: libngtcp2.16-1.7.0-alt1 sisyphus+356415.200.1.1 1725031912 installed <13>Sep 16 01:23:19 rpmi: libngtcp2_crypto_gnutls8-1.7.0-alt1 sisyphus+356415.200.1.1 1725031912 installed <13>Sep 16 01:23:19 rpmi: cmake-modules-3.29.3-alt1 sisyphus+348648.100.2.1 1716590540 installed <13>Sep 16 01:23:19 rpmi: libuv-1.48.0-alt2 sisyphus+357579.100.1.1 1726426171 installed <13>Sep 16 01:23:19 rpmi: librhash-1.3.5-alt3 sisyphus+286141.40.2.1 1632982456 installed <13>Sep 16 01:23:19 rpmi: libjsoncpp24-1.9.4-alt2 sisyphus+346331.200.2.1 1716448551 installed <13>Sep 16 01:23:19 rpmi: libexpat-2.5.0-alt1 sisyphus+346180.200.2.1 1716349835 installed <13>Sep 16 01:23:19 rpmi: publicsuffix-list-dafsa-20240911-alt1 sisyphus+357399.100.1.1 1726160479 installed <13>Sep 16 01:23:19 rpmi: libpsl-0.21.5-alt1 sisyphus+338474.100.1.1 1705684769 installed <13>Sep 16 01:23:19 rpmi: libnghttp3.9-1.5.0-alt1 sisyphus+356415.100.1.1 1725031855 installed <13>Sep 16 01:23:19 rpmi: libnghttp2-1.63.0-alt1 sisyphus+356414.100.1.1 1725031508 installed <13>Sep 16 01:23:19 rpmi: openldap-common-2.6.8-alt1 sisyphus+351621.100.1.1 1719420449 installed <13>Sep 16 01:23:19 rpmi: libntlm-1.5-alt1 sisyphus+278100.3300.1.1 1626058899 installed <13>Sep 16 01:23:19 rpmi: libidn-1.37-alt2 sisyphus+300849.100.1.1 1653769687 installed <13>Sep 16 01:23:19 rpmi: libverto-0.3.2-alt1_1 sisyphus+321176.2200.10.2 1684803947 installed <13>Sep 16 01:23:19 rpmi: liblmdb-0.9.32-alt1 sisyphus+342426.100.1.1 1710124288 installed <13>Sep 16 01:23:19 rpmi: libkeyutils-1.6.3-alt1 sisyphus+346336.200.2.2 1716472658 installed <13>Sep 16 01:23:19 rpmi: libcom_err-1.46.4.0.5.4cda-alt1 sisyphus+283826.100.1.1 1629975345 installed <13>Sep 16 01:23:19 rpmi: libbrotlicommon-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Sep 16 01:23:19 rpmi: libbrotlidec-1.1.0-alt1 sisyphus+328501.100.1.1 1693598419 installed <13>Sep 16 01:23:19 rpmi: rpm-macros-cmake-3.29.1-alt1 sisyphus+344518.300.3.1 1712379787 installed <13>Sep 16 01:23:19 rpmi: rpm-macros-alternatives-0.5.2-alt2 sisyphus+315270.200.2.1 1676457367 installed <13>Sep 16 01:23:19 rpmi: alternatives-0.5.2-alt2 sisyphus+315270.200.2.1 1676457367 installed <13>Sep 16 01:23:19 rpmi: ca-certificates-2024.07.01-alt1 sisyphus+351897.100.1.1 1719826350 installed <13>Sep 16 01:23:19 rpmi: ca-trust-0.2.0-alt1 sisyphus+344843.100.1.1 1712743326 installed <13>Sep 16 01:23:19 rpmi: p11-kit-trust-1:0.25.5-alt1 sisyphus+352553.100.1.1 1720622573 installed <13>Sep 16 01:23:19 rpmi: libcrypto3-3.1.7-alt1 sisyphus+356755.100.1.1 1725388416 installed <13>Sep 16 01:23:19 rpmi: libssl3-3.1.7-alt1 sisyphus+356755.100.1.1 1725388416 installed <86>Sep 16 01:23:19 groupadd[636473]: group added to /etc/group: name=_keytab, GID=999 <86>Sep 16 01:23:19 groupadd[636473]: group added to /etc/gshadow: name=_keytab <86>Sep 16 01:23:19 groupadd[636473]: new group: name=_keytab, GID=999 <13>Sep 16 01:23:19 rpmi: libkrb5-1.21.3-alt2 sisyphus+351857.100.1.1 1719735141 installed <13>Sep 16 01:23:19 rpmi: libgsasl-2.2.0-alt1 sisyphus+333173.100.1.1 1698696954 installed <86>Sep 16 01:23:19 groupadd[636583]: group added to /etc/group: name=sasl, GID=998 <86>Sep 16 01:23:19 groupadd[636583]: group added to /etc/gshadow: name=sasl <86>Sep 16 01:23:19 groupadd[636583]: new group: name=sasl, GID=998 <13>Sep 16 01:23:19 rpmi: libsasl2-3-2.1.28-alt2 sisyphus+343335.100.1.1 1711112544 installed <13>Sep 16 01:23:19 rpmi: libldap2-2.6.8-alt1 sisyphus+351621.100.1.1 1719420449 installed <13>Sep 16 01:23:19 rpmi: libarchive13-3.6.1-alt2 sisyphus+324359.1300.6.1 1689326379 installed <13>Sep 16 01:23:19 rpmi: libssh2-1.11.0-alt2 sisyphus+339356.100.1.1 1706593137 installed <13>Sep 16 01:23:19 rpmi: libcurl-8.10.0-alt1 sisyphus+357271.100.1.1 1726044759 installed <13>Sep 16 01:23:20 rpmi: cmake-3.29.3-alt1 sisyphus+348648.100.2.1 1716590540 installed <13>Sep 16 01:23:28 rpmi: llvm-common-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Sep 16 01:23:28 rpmi: llvm-rocm-filesystem-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:23:28 rpmi: libnuma-2.0.14-alt2 sisyphus+278485.100.1.1 1626104244 installed <13>Sep 16 01:23:28 rpmi: rocm-device-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:23:28 rpmi: llvm18.1-filesystem-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:29 rpmi: clang18.1-support-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:29 rpmi: llvm18.1-polly-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:29 rpmi: gcc-c++-common-1.4.28-alt1 sisyphus+348678.100.1.1 1716396142 installed <13>Sep 16 01:23:29 rpmi: libstdc++13-devel-13.2.1-alt4 sisyphus+354645.100.1.1 1723060849 installed <13>Sep 16 01:23:29 rpmi: librocm-smi1-6.1.2-alt0.2 sisyphus+352428.100.1.1 1720459745 installed <13>Sep 16 01:23:29 rpmi: libpciaccess-1:0.18.1-alt1 sisyphus+343583.300.1.1 1711440789 installed <13>Sep 16 01:23:29 rpmi: libdrm-1:2.4.123-alt1 sisyphus+357330.40.3.1 1726125397 installed <13>Sep 16 01:23:29 rpmi: libhsakmt1-6.1.2-alt0.1 sisyphus+352247.600.5.1 1720254766 installed <13>Sep 16 01:23:29 rpmi: libhsa-runtime1-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Sep 16 01:23:29 rpmi: libpci-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Sep 16 01:23:29 rpmi: pciids-20240913-alt1 sisyphus+357455.100.1.1 1726250568 installed <13>Sep 16 01:23:29 rpmi: pciutils-3.13.0-alt1 sisyphus+350694.100.1.1 1717993339 installed <13>Sep 16 01:23:29 rpmi: libmpdec3-2.5.1-alt3 sisyphus+314490.500.5.1 1675432004 installed <13>Sep 16 01:23:29 rpmi: libgdbm-1.8.3-alt10 sisyphus+346222.200.3.2 1716468404 installed <13>Sep 16 01:23:29 rpmi: libb2-0.98.1-alt1_1 sisyphus+291614.100.1.1 1638962877 installed <13>Sep 16 01:23:29 rpmi: python3-3.12.6-alt1 sisyphus+357228.100.1.1 1725970095 installed <13>Sep 16 01:23:30 rpmi: python3-base-3.12.6-alt1 sisyphus+357228.100.1.1 1725970095 installed <13>Sep 16 01:23:30 rpmi: clang-rocm-libs-support-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:23:33 rpmi: clang-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:23:33 rpmi: rocminfo-6.1.2-alt0.1 sisyphus+352247.1700.9.1 1720269882 installed <13>Sep 16 01:23:33 rpmi: libedit3-3.1.20230828-alt1 sisyphus+330914.200.3.1 1696922743 installed <13>Sep 16 01:23:33 rpmi: llvm18.1-gold-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:34 rpmi: llvm18.1-libs-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:35 rpmi: libclang-cpp18-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:35 rpmi: clang18.1-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:35 rpmi: clang-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Sep 16 01:23:36 rpmi: clang-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:23:38 rpmi: llvm18.1-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:38 rpmi: llvm-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Sep 16 01:23:50 rpmi: llvm-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:23:50 rpmi: libclang18-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:50 rpmi: clang18.1-devel-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:50 rpmi: clang-devel-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Sep 16 01:23:51 rpmi: clang18.1-tools-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:51 rpmi: clang-tools-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Sep 16 01:23:57 rpmi: clang-rocm-tools-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:23:57 rpmi: lld18.1-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:23:57 rpmi: lld-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Sep 16 01:23:58 rpmi: lld-rocm-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:00 rpmi: libamd_comgr2-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:01 rpmi: llvm-rocm-gold-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:01 rpmi: llvm-rocm-libs-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:02 rpmi: hip-runtime-amd-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Sep 16 01:24:02 rpmi: hipcc-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:04 rpmi: mlir18.1-tools-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:24:21 rpmi: llvm18.1-devel-18.1.8-alt0.1 sisyphus+351511.100.1.1 1719305993 installed <13>Sep 16 01:24:21 rpmi: llvm-devel-18.1.0-alt1 sisyphus+343963.100.6.1 1713343158 installed <13>Sep 16 01:24:33 rpmi: llvm-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:33 rpmi: hip-devel-6.1.2-alt0.2 sisyphus+352364.100.1.1 1720383820 installed <13>Sep 16 01:24:33 rpmi: rocm-comgr-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:42 rpmi: clang-rocm-devel-6.1.2-alt0.2 sisyphus+352528.100.1.1 1720604563 installed <13>Sep 16 01:24:43 rpmi: hipify-clang-6.1.2-alt0.1 sisyphus+352428.200.1.1 1720459887 installed <13>Sep 16 01:24:43 rpmi: hsa-rocr-devel-6.1.2-alt0.1 sisyphus+352247.1600.9.1 1720269840 installed <13>Sep 16 01:24:43 rpmi: librocm-smi-devel-6.1.2-alt0.2 sisyphus+352428.100.1.1 1720459745 installed <13>Sep 16 01:24:43 rpmi: libstdc++-devel-13-alt1 sisyphus+323337.300.1.1 1687267966 installed <13>Sep 16 01:24:43 rpmi: rocm-cmake-6.1.2-alt0.1 sisyphus+352247.100.1.1 1720180839 installed Building target platforms: x86_64 Building for target x86_64 Wrote: /usr/src/in/nosrpm/rccl-2.18.6-alt0.1.nosrc.rpm (w1.gzdio) Installing rccl-2.18.6-alt0.1.src.rpm Building target platforms: x86_64 Building for target x86_64 Executing(%prep): /bin/sh -e /usr/src/tmp/rpm-tmp.69590 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + rm -rf rccl-2.18.6 + echo 'Source #0 (rccl-2.18.6.tar):' Source #0 (rccl-2.18.6.tar): + /bin/tar -xf /usr/src/RPM/SOURCES/rccl-2.18.6.tar + cd rccl-2.18.6 + /bin/chmod -c -Rf u+rwX,go-w . + subst 's,cat ${ROCM_PATH}/.info/version,echo 6.1.2,' CMakeLists.txt + exit 0 Executing(%build): /bin/sh -e /usr/src/tmp/rpm-tmp.69590 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + export ALTWRAP_LLVM_VERSION=rocm + ALTWRAP_LLVM_VERSION=rocm + mkdir -p x86_64-alt-linux + cmake -DCMAKE_SKIP_INSTALL_RPATH:BOOL=yes '-DCMAKE_C_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_CXX_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' '-DCMAKE_Fortran_FLAGS:STRING=-pipe -frecord-gcc-switches -Wall -g -O2 ' -DCMAKE_INSTALL_PREFIX=/usr -DINCLUDE_INSTALL_DIR:PATH=/usr/include -DLIB_INSTALL_DIR:PATH=/usr/lib64 -DSYSCONF_INSTALL_DIR:PATH=/etc -DSHARE_INSTALL_PREFIX:PATH=/usr/share -DLIB_DESTINATION=lib64 -DLIB_SUFFIX=64 -S . -B x86_64-alt-linux -Wno-dev -DROCM_PATH=/usr -DCMAKE_C_COMPILER=clang -DCMAKE_CXX_COMPILER=clang++ -DCMAKE_INSTALL_LIBDIR=lib64 -DENABLE_MSCCL_KERNEL=ON -- The CXX compiler identification is Clang 17.0.0 -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/clang++ - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- Checking for ROCm support for GPU targets: -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 -- Performing Test COMPILER_HAS_TARGET_ID_gfx803 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx900_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx906_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx908_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_off - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on -- Performing Test COMPILER_HAS_TARGET_ID_gfx90a_xnack_on - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 -- Performing Test COMPILER_HAS_TARGET_ID_gfx940 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 -- Performing Test COMPILER_HAS_TARGET_ID_gfx941 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 -- Performing Test COMPILER_HAS_TARGET_ID_gfx942 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1030 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1100 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1101 - Success -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 -- Performing Test COMPILER_HAS_TARGET_ID_gfx1102 - Success -- Compiling for gfx803;gfx900:xnack-;gfx906:xnack-;gfx908:xnack-;gfx90a:xnack-;gfx90a:xnack+;gfx940;gfx941;gfx942;gfx1030;gfx1100;gfx1101;gfx1102 -- Could NOT find GTest (missing: GTEST_LIBRARY GTEST_INCLUDE_DIR GTEST_MAIN_LIBRARY) (Required is at least version "1.11") -- ROCM_PATH found: /usr -- Performing Test CMAKE_HAVE_LIBC_PTHREAD -- Performing Test CMAKE_HAVE_LIBC_PTHREAD - Success -- Found Threads: TRUE -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS -- Performing Test HIP_CLANG_SUPPORTS_PARALLEL_JOBS - Success -- HIP compiler: clang -- HIP runtime: rocclr -- hipcc executable: /usr/bin/hipcc -- hipcc version: 6.1.40093 -- ROCm version: 6.1.2 ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:79 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:80 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipEventDisableSystemFence -- Looking for hipEventDisableSystemFence - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:84 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:145 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:79 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:80 (string) /usr/share/cmake/Modules/CheckSymbolExists.cmake:69 (__CHECK_SYMBOL_EXISTS_FILTER_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- Looking for hipDeviceMallocUncached -- Looking for hipDeviceMallocUncached - not found ******************************************************************************* *------------------------------- ROCMChecks WARNING --------------------------* Options and properties should be set on a cmake target where possible. The variable 'CMAKE_CXX_FLAGS' may be set by the cmake toolchain, either by calling 'cmake -DCMAKE_CXX_FLAGS="-pipe -frecord-gcc-switches -Wall -g -O2"' or set in a toolchain file and added with 'cmake -DCMAKE_TOOLCHAIN_FILE='. ROCMChecks now calling: CMake Warning at /usr/share/rocmcmakebuildtools/cmake/ROCMChecks.cmake:46 (message): 'CMAKE_CXX_FLAGS' is set at /usr/src/RPM/BUILD/rccl-2.18.6/CMakeLists.txt: shown below: Call Stack (most recent call first): /usr/share/cmake/Modules/CheckSymbolExists.cmake:9223372036854775807 (rocm_check_toolchain_var) /usr/share/cmake/Modules/CheckSymbolExists.cmake:84 (set) /usr/share/cmake/Modules/CheckSymbolExists.cmake:71 (__CHECK_SYMBOL_EXISTS_RESTORE_FLAGS) CMakeLists.txt:148 (check_symbol_exists) *-----------------------------------------------------------------------------* ******************************************************************************* -- HSA runtime: /usr/include -- Found rocm_smi at /usr/include -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h -- Looking for C++ include /usr/include/rocm_smi/rocm_smi64Config.h - found -- Performing Test HAVE_KERNARG_PRELOAD -- Performing Test HAVE_KERNARG_PRELOAD - Success -- Kernarg preloading to SGPR enabled -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.h -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp -- Generating /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp -- HIP_UNCACHED_MEMORY enabled -- RCCL LL128 protocol enabled -- Building shared RCCL library -- rocm-cmake: Set license file to /usr/src/RPM/BUILD/rccl-2.18.6/LICENSE.txt. -- Configuring done (14.6s) -- Generating done (0.0s) CMake Warning: Manually-specified variables were not used by the project: CMAKE_C_COMPILER CMAKE_C_FLAGS CMAKE_Fortran_FLAGS LIB_DESTINATION LIB_SUFFIX SHARE_INSTALL_PREFIX SYSCONF_INSTALL_DIR -- Build files have been written to: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux + cmake --build x86_64-alt-linux --verbose --parallel 16 Change Dir: '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' Run Build Command(s): /usr/bin/cmake -E env VERBOSE=1 /usr/bin/gmake -f Makefile -j16 gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -S/usr/src/RPM/BUILD/rccl-2.18.6 -B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux --check-build-system CMakeFiles/Makefile.cmake 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux//CMakeFiles/progress.marks gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/Makefile2 all /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/git_version_check.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/git_version_check.dir/build.make CMakeFiles/git_version_check.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Updating git_version.cpp if necessary /usr/bin/cmake -P /usr/src/RPM/BUILD/rccl-2.18.6/cmake/git_version.cmake -- Updating git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Built target git_version_check gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/depend gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_to_all.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_all.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/collectives/all_to_allv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/all_to_allv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/broadcast.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/broadcast.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 1%] Hipifying src/channel.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/channel.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 3%] Hipifying src/collectives/device/alltoall_pivot.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/alltoall_pivot.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/broadcast.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/broadcast.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/onerank_reduce.cu -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/onerank_reduce.cu -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/all_gather.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_gather.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/bootstrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/bootstrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 0%] Hipifying src/transport/shm.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/shm.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/common_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 2%] Hipifying src/collectives/device/all_reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/all_reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/common.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/common.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/msccl_kernel_impl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/msccl_kernel_impl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 4%] Hipifying src/collectives/device/op128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/op128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/op128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/primitives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/primitives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/reduce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/gather.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/gather.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/reduce_scatter.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_scatter.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/reduce_scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/reduce_scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 6%] Hipifying src/collectives/device/sendrecv.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/sendrecv.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/msccl.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/msccl.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_ll128.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll128.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_ll.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_ll.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/collectives/scatter.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/scatter.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/collectives/sendrecv.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/sendrecv.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/reduce_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/reduce_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 7%] Hipifying src/debug.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/debug.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 5%] Hipifying src/collectives/device/prims_simple.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/collectives/device/prims_simple.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/graph/connect.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/connect.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/graph/rings.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/rings.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rings.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/rome_models.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/trees.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/trees.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/BfdBacktrace.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/BfdBacktrace.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/BfdBacktrace.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/graph/xml.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/group.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/group.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 9%] Hipifying src/graph/paths.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/paths.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 8%] Hipifying src/enqueue.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/enqueue.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/tuning.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/tuning.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/topo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/topo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 11%] Hipifying src/graph/xml.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/xml.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/search.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/search.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 10%] Hipifying src/graph/rome_models.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/graph/rome_models.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 12%] Hipifying src/include/align.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/align.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/align.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/alloc.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/alloc.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/archinfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/archinfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/archinfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/argcheck.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/argcheck.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 13%] Hipifying src/include/bootstrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/bootstrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/bootstrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/channel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/channel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/checks.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/checks.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/coll_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/coll_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/core.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/core.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/cpuset.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/cpuset.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/cpuset.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/debug.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/debug.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/debug.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/enqueue.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/enqueue.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 15%] Hipifying src/include/comm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/comm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/devcomm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/devcomm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/devcomm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 16%] Hipifying src/include/gdrwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/gdrwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 14%] Hipifying src/include/collectives.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/collectives.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/collectives.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/git_version.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/git_version.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/git_version.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/group.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/group.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/graph.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/graph.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 17%] Hipifying src/include/ibvcore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvcore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvcore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvsymbols.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvsymbols.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvsymbols.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/ibvwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ibvwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/info.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/info.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_lifecycle.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_lifecycle.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_lifecycle.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/ipcsocket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/ipcsocket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ipcsocket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_parser.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_parser.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_scheduler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_scheduler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_scheduler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 19%] Hipifying src/include/msccl/msccl_status.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_status.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_status.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 18%] Hipifying src/include/msccl/msccl_kernel.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_kernel.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_kernel.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_setup.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_setup.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_setup.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/msccl/msccl_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/msccl/msccl_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 20%] Hipifying src/include/nccl_net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nccl_net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nccl_net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/net.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/net.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_event.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_event.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_event.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 21%] Hipifying src/include/npkit/npkit_struct.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/npkit/npkit_struct.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/npkit/npkit_struct.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvmlwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvmlwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvmlwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExtCuda.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCuda.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCuda.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExtCudaRt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtCudaRt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtCudaRt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtOpenCL.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtOpenCL.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtOpenCL.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 22%] Hipifying src/include/nvtx3/nvToolsExt.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExt.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExt.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtSync.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtSync.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtSync.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCudaRt_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 23%] Hipifying src/include/nvtx3/nvToolsExtPayload.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvToolsExtPayload.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvToolsExtPayload.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCore.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCore.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCore.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplCuda_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplOpenCL_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxImplSync_v3.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 25%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDecls.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxLinkOnce.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 26%] Hipifying src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxInitDefs.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 24%] Hipifying src/include/nvtx3/nvtx3.hpp -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3 && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtx3.hpp -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtx3.hpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImpl.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxDetail/nvtxTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxDetail/nvtxTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxDetail/nvtxTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtImplPayload_v1.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtPayloadTypeInfo.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtTypes.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 27%] Hipifying src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx3/nvtxExtDetail/nvtxExtInit.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 28%] Hipifying src/include/nvtx_stub.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/nvtx_stub.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/nvtx_stub.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/p2p.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/p2p.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/p2p.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/param.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/param.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/param.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/profiler.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/profiler.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 29%] Hipifying src/include/proxy.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/proxy.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_vars.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_vars.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_vars.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rccl_bfloat16.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rccl_bfloat16.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rccl_bfloat16.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 30%] Hipifying src/include/rocm_smi_wrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocm_smi_wrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocm_smi_wrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/rocmwrap.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/rocmwrap.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/rocmwrap.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/shm.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/shm.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/shm.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/signals.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/signals.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/signals.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 31%] Hipifying src/include/socket.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/socket.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/socket.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/strongstream.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/strongstream.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/strongstream.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/timer.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/timer.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/timer.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/trees.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/trees.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/trees.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 32%] Hipifying src/include/transport.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/transport.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/argcheck.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/argcheck.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/misc/archinfo.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/archinfo.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/include/utils.h -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/include/utils.h -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvsymbols.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvsymbols.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ibvwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ibvwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 34%] Hipifying src/misc/ipcsocket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/ipcsocket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_lifecycle.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_lifecycle.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_status.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_status.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_setup.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_setup.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 35%] Hipifying src/misc/msccl/msccl_parser.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/msccl/msccl_parser.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/npkit.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/npkit.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/nvmlwrap_stub.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/nvmlwrap_stub.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 33%] Hipifying src/init.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/init.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/param.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/param.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 36%] Hipifying src/misc/profiler.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/profiler.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocm_smi_wrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocm_smi_wrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/rocmwrap.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/rocmwrap.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/signals.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/signals.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 37%] Hipifying src/misc/shmutils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/shmutils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/utils.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/utils.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 38%] Hipifying src/misc/strongstream.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/misc/strongstream.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/transport.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/proxy.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/proxy.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 41%] Hipifying src/transport/p2p.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/p2p.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 39%] Hipifying src/transport/coll_net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/coll_net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/nvls.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/nvls.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_socket.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_socket.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 40%] Hipifying src/transport/net_ib.cc -> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc mkdir -p /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport && /usr/bin/hipify-perl -quiet-warnings /usr/src/RPM/BUILD/rccl-2.18.6/src/transport/net_ib.cc -o /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' cd /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux && /usr/bin/cmake -E cmake_depends "Unix Makefiles" /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles/rccl.dir/DependInfo.cmake "--color=" gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/gmake -f CMakeFiles/rccl.dir/build.make CMakeFiles/rccl.dir/build gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/debug.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/debug.cc.o -MF CMakeFiles/rccl.dir/hipify/src/debug.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/debug.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/debug.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ 3 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:18:38: warning: unused variable 'AllGatherSchema' [-Wunused-variable] 18 | constexpr nvtxPayloadSchemaEntry_t AllGatherSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:21:10: warning: unused variable 'msgsize' [-Wunused-variable] 21 | size_t msgsize = sendcount * ncclTypeSize(datatype); | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_gather.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/bootstrap.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/gather.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:23:38: warning: unused variable 'ReduceSchema' [-Wunused-variable] 23 | constexpr nvtxPayloadSchemaEntry_t ReduceSchema[] = { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:29:20: warning: unused variable 'payload' [-Wunused-variable] 29 | NvtxParamsReduce payload{count * ncclTypeSize(datatype), root, op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:27:22: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:50:22: warning: unused variable 'payload' [-Wunused-variable] 50 | NvtxParamsSendRecv payload{count * ncclTypeSize(datatype), peer}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/sendrecv.cc:18:42: warning: unused variable 'SendRecvSchema' [-Wunused-const-variable] 18 | constexpr const nvtxPayloadSchemaEntry_t SendRecvSchema[] = { | ^~~~~~~~~~~~~~ 4 warnings generated when compiling for host. 4 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ 3 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:21:38: warning: unused variable 'BroadcastSchema' [-Wunused-variable] 21 | constexpr nvtxPayloadSchemaEntry_t BroadcastSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:25:23: warning: unused variable 'payload' [-Wunused-variable] 25 | NvtxParamsBroadcast payload{count * ncclTypeSize(datatype), root}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/broadcast.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:22:45: warning: unused variable 'AllReduceSchema' [-Wunused-variable] 22 | static constexpr nvtxPayloadSchemaEntry_t AllReduceSchema[] = { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:27:23: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsAllReduce payload{count * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_reduce.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_allv.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/scatter.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 43%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx941. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx1100. 5 warnings generated when compiling for gfx1102. 5 warnings generated when compiling for gfx1101. 5 warnings generated when compiling for gfx900. 5 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/all_to_all.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 45%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx941. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:22:38: warning: unused variable 'ReduceScatterSchema' [-Wunused-variable] 22 | constexpr nvtxPayloadSchemaEntry_t ReduceScatterSchema[] = { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:27:27: warning: unused variable 'payload' [-Wunused-variable] 27 | NvtxParamsReduceScatter payload{recvcount * ncclTypeSize(datatype), op}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/reduce_scatter.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx1100. 4 warnings generated when compiling for gfx908. 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/enqueue.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/msccl.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. 4 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 42%] Building CXX object CMakeFiles/rccl.dir/hipify/src/channel.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/channel.cc.o -MF CMakeFiles/rccl.dir/hipify/src/channel.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/channel.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | statiIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ c gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/channel.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 8 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rings.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:104:12: warning: unused variable 'y' [-Wunused-variable] 104 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:111:7: warning: unused variable 'localRanks' [-Wunused-variable] 111 | int localRanks = comm->topo->nodes[GPU].count; | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/connect.cc:230:21: warning: unused function 'getIndexes' [-Wunused-function] 230 | static ncclResult_t getIndexes(int* ranks, int* indexes, int nNodes) { | ^~~~~~~~~~ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/trees.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:338:10: warning: unused variable 'llMaxBw' [-Wunused-variable] 338 | double llMaxBw = llMaxBws[index1][index2]; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:339:10: warning: unused variable 'perChMaxTreeBw' [-Wunused-variable] 339 | double perChMaxTreeBw = perChMaxTreeBws[compCapIndex][index2]; | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:340:10: warning: unused variable 'perChMaxRingLL128Bw' [-Wunused-variable] 340 | double perChMaxRingLL128Bw = perChMaxRingLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:341:10: warning: unused variable 'perChMaxTreeLL128Bw' [-Wunused-variable] 341 | double perChMaxTreeLL128Bw = perChMaxTreeLL128Bws[compCapIndex][index2]; | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:344:9: warning: unused variable 'ppn' [-Wunused-variable] 344 | float ppn = (float)nRanks / nNodes; // if ppn < 2, then we are sending/receiving at the same GPU through the NIC, apply some bw discount | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/tuning.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/group.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/group.cc.o -MF CMakeFiles/rccl.dir/hipify/src/group.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/group.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx906. 2 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx900. 2 warnings generated when compiling for gfx1030. 2 warnings generated when compiling for gfx803. 2 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx90a. 2 warnings generated when compiling for gfx940. 2 warnings generated when compiling for gfx1101. 2 warnings generated when compiling for gfx1100. 2 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/group.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/group.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 2 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ rIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx90a. 20 warnings generated when compiling for gfx803. 20 warnings generated when compiling for gfx1102. 20 warnings generated when compiling for gfx1101. 20 warnings generated when compiling for gfx90a. 20 warnings generated when compiling for gfx941. 20 warnings generated when compiling for gfx906. 20 warnings generated when compiling for gfx900. 20 warnings generated when compiling for gfx1100. 20 warnings generated when compiling for gfx940. 20 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:270:7: warning: variable 'intermediateIndex' set but not used [-Wunused-but-set-variable] 270 | int intermediateIndex = -1; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/paths.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 20 warnings generated when compiling for gfx942. 20 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/archinfo.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 44%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/onerank_reduce.cu.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for 1gfx906 warning. generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/argcheck.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/argcheck.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cct:A1238t:t15r:F lwarning: ounused variable 'ringRemap' [-Wunused-variable]a t(struct 1238n | c c lsXtmaltNiocd ec*h anro drei,n gcRoenmsatp [c6h4a]r;* a| t ^~~~~~~~~t rNam/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cce:,1242 :f7l:o awarning: tunused variable 'ncpus' [-Wunused-variable]* val u1242e | ) {i n t| ^~~~~~~~~~~~~~~n cpus/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h :=116 :s21y:s twarning: eunused function 'xmlFindTag' [-Wunused-function]m ->node s116[ | CsPtUa]t.icco unnctc;l R e| s ^~~~~u lt_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc :x1327m:l9F:i nwarning: dunused variable 't' [-Wunused-variable]T ag(st r1327u | c t fnlcocaltX mtl *= x(mtlv,e .ctovn_sste cc h-a rt*v st.atgvN_asmeec,) *s1tEr3u c+t (ntcvcel.Xtmvl_Nuosdeec* *- ntovdse.)t v{_ u s| e ^~~~~~~~~~c )/1E3;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h : 128| : ^21 : warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t =/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ 783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct 28 warnings generated when compiling for gfx900. ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ 28 warnings generated when compiling for gfx1100. 28 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ 28 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx1030. 28 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx1101. 28 warnings generated when compiling for gfx1102. 28 warnings generated when compiling for gfx940. 28 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx942. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:773:7: warning: unused variable 'nChannels' [-Wunused-variable] 773 | int nChannels = 0; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:783:12: warning: unused variable 'y' [-Wunused-variable] 783 | int x=0, y=0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1238:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1238 | static char ringRemap[64]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1242:7: warning: unused variable 'ncpus' [-Wunused-variable] 1242 | int ncpus = system->nodes[CPU].count; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1327:9: warning: unused variable 't' [-Wunused-variable] 1327 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1364:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1364 | static char ringRemap[256]; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1385:7: warning: variable 'gcnt' set but not used [-Wunused-but-set-variable] 1385 | int gcnt = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1460:9: warning: unused variable 't' [-Wunused-variable] 1460 | float t = (tve.tv_sec - tvs.tv_sec)*1E3 + (tve.tv_usec - tvs.tv_usec)/1E3; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:1499:15: warning: unused variable 'ringRemap' [-Wunused-variable] 1499 | static char ringRemap[256]; | ^~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:22: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:24: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/rome_models.cc:25: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 28 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNet/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ CloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(structIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc int :s865i:z19e:, warning: ivariable 'cId' set but not used [-Wunused-but-set-variable]n t type, void** mh a865n | d l ei)n t{ gNICnCdLeCxH E=C K0(,c ocmImd- >=n c0c,l Cno l=l N0e;t - >| r ^e gMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSe ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const chtAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ ar* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ 23 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx1100. 23 warnings generated when compiling for gfx941. 23 warnings generated when compiling for gfx906. 23 warnings generated when compiling for gfx1030. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx803. 23 warnings generated when compiling for gfx90a. 23 warnings generated when compiling for gfx900. 23 warnings generated when compiling for gfx1102. 23 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:865:19: warning: variable 'cId' set but not used [-Wunused-but-set-variable] 865 | int gIndex = 0, cId = 0, n = 0; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 23 warnings generated when compiling for gfx942. 23 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 47%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static nfoldoea,t cnocncsltT ocphoaXrG*M IaStptereNda(mceo,n scto ncshta rc*h agrc*n )v a{l u e| ) ^~~~~~~~~~~~~~~~~ { | In file included from ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc :11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h :101195 | :s21t:a twarning: iunused function 'xmlUnsetAttr' [-Wunused-function]c ncclResul t195_ | ts txamtliGce tnActctlrRIenstuDletf_atu lxtm(lsUtnrsuecttA tntcrc(lsXtmrluNcotd en*c cnloXdmel,N ocdoen*s tn ocdhea,r *c oantsttr Ncahmaer,* ianttt*r Nvaamleu)e ,{ i n| t ^~~~~~~~~~~~ defau/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hl:t207V:a21l:u ewarning: )unused function 'xmlGetSub' [-Wunused-function] { | ^~~~~~~~~~~~~~~~~~~~ 207/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h | :s116t:a21t:i cwarning: unused function 'xmlFindTag' [-Wunused-function]n cclRe s116u | lstt_att ixcm lnGcectlSRuebs(uslttr_utc tx mnlcFcilnXdmTlaNgo(dset*r uncotd en,c ccloXnmslt* cxhmalr,* csounbsNta mceh,a rs*t rtuacgtN anmcec,l XsmtlrNuocdte *n*c csluXbm)l N{o d e| * ^~~~~~~~~* nod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.he:)233 :{21 : | warning: ^~~~~~~~~~unused function 'xmlGetSubKvInt' [-Wunused-function] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: 233warning: | unused function 'xmlFindTagKv' [-Wunused-function]s tatic n c128c | lsRteastuilct _ntc cxlmRleGseutlStu_btK vxImnltF(isntdrTuacgtK vn(csctlrXumcltN ondcec*l Xnmold*e ,x mclo,n scto ncshta rc*h subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ ar* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx941. 10 warnings generated when compiling for gfx1102. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx900. 10 warnings generated when compiling for gfx1101. 10 warnings generated when compiling for gfx906. 10 warnings generated when compiling for gfx1100. 10 warnings generated when compiling for gfx803. 10 warnings generated when compiling for gfx90a. 10 warnings generated when compiling for gfx1030. 10 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/search.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ 10 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 48%] Building CXX object CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -MF CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 8 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:7: warning: variable 'devIndex' is used uninitialized whenever 'if' condition is false [-Wsometimes-uninitialized] 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:798:41: note: uninitialized use occurs here 798 | NCCLCHECK(ncclTopoGetXmlFromGpu(node, devIndex, xml, gpuNode)); | ^~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/checks.h:108:22: note: expanded from macro 'NCCLCHECK' 108 | ncclResult_t RES = call; \ | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:795:3: note: remove the 'if' if its condition is always true 795 | if (rocmsmiInit == 1) { | ^~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:790:20: note: initialize the variable 'devIndex' to silence this warning 790 | uint32_t devIndex; | ^ | = 0 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.cc:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 8 warnings generated when compiling for gfx942. 8 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvsymbols.cc:64: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ipcsocket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/ibvwrap.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/ibvwrap.h:21: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/nvmlwrap_stub.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx1100. 3 warnings generated when compiling for gfx900. 3 warnings generated when compiling for gfx1030. 3 warnings generated when compiling for gfx908. 3 warnings generated when compiling for gfx940. 3 warnings generated when compiling for gfx906. 3 warnings generated when compiling for gfx1101. 3 warnings generated when compiling for gfx90a. 3 warnings generated when compiling for gfx803. 3 warnings generated when compiling for gfx1102. 3 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:711:16: warning: unused variable 'ret' [-Wunused-variable] 711 | ncclResult_t ret = ncclSuccess; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:723:16: warning: unused variable 'ret' [-Wunused-variable] 723 | ncclResult_t ret = ncclSuccess; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_parser.cc:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 3 warnings generated when compiling for gfx942. 3 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocm_smi_wrap.cc:23: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_status.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/param.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/rocmwrap.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 52%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/profiler.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/profiler.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/proxy.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 49%] Building CXX object CMakeFiles/rccl.dir/hipify/src/init.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/init.cc.o -MF CMakeFiles/rccl.dir/hipify/src/init.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/init.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cccol:l1995C:o26m:m ,warning: unused variable 'payload' [-Wunused-variable]v oid* se n1995d | D a tNav,t xvPoairda*m srCeocmvmDIantiat,R ainnkt pcaoyulnota,d {nmcycrlaDnakt,a Tnyrpaen_kts ,d actuadTayDpeev,} ;n c c| l ^~~~~~~R edOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* In file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cco:l10l: CIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.hm:m9): In file included from {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h :N11C: CIn file included from L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.hC:H12E: CIn file included from K/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h(:c124o: mIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h-:>14n: cIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.hl:C60o: lIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.hN:e14t: -/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h>:c40l:o13s:e Cwarning: ounused function 'log2i' [-Wunused-function]l l(col l40C | ostatimcm )l)o;n gr eltougr2ni (nlcocnlgS unc)c e{s s ;| ^~~~~} | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:In file included from 21/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:: 17warning: : unused function 'collNetCloseListen' [-Wunused-function]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :16:20: warning: unused function 'collNetName' [-Wunused-function] 31 | sta t16i | cs tnactcilcR ecsounlstt_ tc hcaorl*l NceotlCllNoesteNLaimset(esnt(rsutcrtu cntc cnlcCcolmCmo*m mc*o mcmo)m m{, rveotiudr*n lciosmtme-n>Cnocmcml)C o{l lNNCeCtL-C>HnEaCmKe(;c o}m m -| > ^~~~~~~~~~~n cclCo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hl:l17N:e21t:- >warning: cunused function 'collNetDevices' [-Wunused-function]l oseList e17n | (sltiasttiecn Cnocmcml)R)e;s urlett_utr nc onlclcNleStuDcecveiscse;s (}s t r| u ^~~~~~~~~~~~~~~~~~c t ncclComm*In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.ccc:o37m: m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h,: 206i:n21t:* warning: nunused function 'ncclTopoRankToIndex' [-Wunused-function]d ev) { N C206C | LsCtHaEtCiKc( cnocmcml-R>enscuclltC_otl lnNcectl-T>odpeovRiacneksT(onIdnedve)x)(;s trreutcutr nn cncclcTloSpuocScyesstse;m *} s y| s ^~~~~~~~~~~~~~t em, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hi:n18t: 21r:a nwarning: kunused function 'collNetGetProperties' [-Wunused-function], int* ind e18x | )s t{a t i| c ^~~~~~~~~~~~~~~~~~~ nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hR:e217s:u21l:t _warning: tunused function 'ncclTopoDevToRank' [-Wunused-function] collN e217t | GsettaPtriocp enrctcileRse(ssutlrtu_ctt nnccccllTCoopmomD*e vcToomRma,n ki(nstt rduecvt, nnccccllTNoeptoPSryospteermt*i essy_stt*e mp,r oipnst) d{e vN,C CiLnCtH*E CrKa(ncko)m m{- > n| c ^~~~~~~~~~~~~~~~~c lCol/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.hl:N229e:t14-:> gwarning: eunused function 'ncclTopoXGMISpeed' [-Wunused-function]t Proper t229i | esst(adteivc, fplrooapts )n)c;c lrTeotpuorXnG MnIcScpleSeudc(cceosnss;t }c h a| r ^~~~~~~~~~~~~~~~~~~~* gcn)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h :{19 : | ^~~~~~~~~~~~~~~~~ 21: warning: unused function 'collNetListen' [-Wunused-function]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: 19warning: | unused function 'xmlGetAttrInt' [-Wunused-function]s tatic n c94c | lsRteastuilct _ntc ccloRlelsNuelttL_its txemnl(GsettrAutcttr Innctc(lsCtormumc*t cnocmcml,X milnNto ddee*v ,n ovdoei,d *c ohnasntd lceh,a rv*o iadt*t*r Nlaimset,e niCnotm*m )v a{l uNeC)C L{C H E| C ^~~~~~~~~~~~~K (comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h-:>101n:c21c:l Cwarning: ounused function 'xmlGetAttrIntDefault' [-Wunused-function]l lNet->l i101s | tsetna(tdiecv ,n chcalnRdelseu,l tl_its txemnlCGoemtmA)t)t;r IrnettDuerfna unlctc(lsSturcuccets sn;c c}l X m| l ^~~~~~~~~~~~~N ode* /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hn:o20d:e21,: cwarning: ounused function 'collNetConnect' [-Wunused-function]n st char* attr N20a | mset,a tiinct *n cvcallRuees,u litn_tt dceoflaluNlettVCaolnunee)c t{( s t| r ^~~~~~~~~~~~~~~~~~~~u ct n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hc:c109l:C21o:m mwarning: *unused function 'xmlGetAttrFloat' [-Wunused-function] comm, v109o | isdt*a thiacn dnlcecsl[R]e,s uilntt_ tn rxamnlkGse,t Aitnttr Frlaonakt,( svtoriudc*t lnicsctleXnmCloNmomd,e *v oniodd*e*, ccoolnlsCto mcmh)a r{* NaCtCtLrCNHaEmCeK,( cfolmoma-t>*n cvcallCuoel)l N{e t -| > ^~~~~~~~~~~~~~~c onne/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hc:t116(:h21a:n dwarning: lunused function 'xmlFindTag' [-Wunused-function]e s, nr a116n | ksst,a triacn kn,c clliRsetseunlCto_mtm ,x mcloFlilnCdoTmamg)()s;t rruecttu rnnc cnlcXcmllS*u cxcmels,s ;c o}n s t| ^~~~~~~~~~~~~~c har* /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.ht:a22g:N21a:m ewarning: ,unused function 'collNetRegMr' [-Wunused-function] struct nccl X22m | lsNtoadtei*c* nncocdleR)e s{u l t| _ ^~~~~~~~~~t col/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hl:N128e:t21R:e gwarning: Munused function 'xmlFindTagKv' [-Wunused-function]r (struct 128n | csctlaCtoimcm *n cccolmRme,s uvloti_dt* xcmollFliCnodmTma,g Kvvo(isdt*r udcatt an,c cilnXtm ls*i zxem,l ,i ncto ntsytp ec,h avro*i dt*a*g Nmahmaen,d lset)r u{c tN CnCcLcClHXEmClKN(ocdoem*m*- >nnocdcel,C oclolnNsett -c>hraerg*M ra(tctorlNlaCmoem,m ,c odnastta ,c hsairz*e ,a tttyrpVea,l umeh)a n{d l e| ) ^~~~~~~~~~~~) ; ret/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hu:r144n: 21n:c cwarning: lunused function 'xmlSetAttr' [-Wunused-function]S uccess ;144 | }s t a| t ^~~~~~~~~~~~i c ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.hl:R24e:s21u:l twarning: _unused function 'collNetRegMrDmaBuf' [-Wunused-function]t xmlSetAttr( s24t | rsutcatt incc cnlcXcmllRNeosduel*t _nto dceo,l lcNoentsRte gcMhraDrm*a Bautft(rsNtarmuec,t cnocncsltC ocmhma*r *c ovmaml,u ev)o i{d * | c ^~~~~~~~~~o llCom/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hm:,157 :v21o:i dwarning: *unused function 'xmlSetAttrIfUnset' [-Wunused-function] data, i157n | ts tsaitziec, nicnctl Rteyspuel,t _uti nxtm6l4S_ett AotftfrsIeftU,n sientt( sftdr,u cvto indc*c*l XmmhlaNnoddlee*) n{o dNeC,C LcCoHnEsCtK (cchoamrm*- >antctcrlNCaomlel,N ecto-n>srte gcMhraDrm*a Bvuafl(uceo)l l{C o m| m ^~~~~~~~~~~~~~~~~, data/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h,: 169s:i21z:e ,warning: unused function 'xmlSetAttrInt' [-Wunused-function]t ype, o169f | fssteatt,i cf dn,c cmlhRaensdullet)_)t; xrmeltSuertnA tntcrcIlnStu(csctersusc;t }n c c| l ^~~~~~~~~~~~~~~~~~X mlNod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.he:*25 :n21o:d ewarning: ,unused function 'collNetDeregMr' [-Wunused-function] const ch a25r | *s taatttircN anmcec,l Rceosnusltt _itn tc ovlallNueet)D e{r e g| M ^~~~~~~~~~~~~r (stru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.hc:t182 ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ :21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.ccL:C2311H:E26C:K (warning: cunused variable 'payload' [-Wunused-variable]o mm->n c2311c | l C oNlvltNxePta-r>atmessCto(mrmeIqnuietsRta,n kd opnaey,l osaidz{er)a)n;k ,r entruarnnk sn,c ccluSduacDceevs}s;; }| ^~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.ccc:c2341l:T26o:p owarning: Runused variable 'payload' [-Wunused-variable]a nkToI n2341d | e x (NsvttrxuPcatr anmcscCloTmompIonSiytsRtaenmk* psayysltoeamd,{ rianntk ,r annrka,n kisn,t *c uidnadDeexv)} ;{ | | ^~~~~~~ ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(in/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cct: 2311v:a26l:u ewarning: ,unused variable 'payload' [-Wunused-variable] const c h2311a | r * *N vsttxrP,a rsatmrsuCcotm mkIvnDiitcRta*n kd ipcaty)l o{a d {| r ^~~~~~~~~~~~~~a nk, nr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cca:n782k:s21,: cwarning: uunused function 'collNetTrySetup' [-Wunused-function]d aDev}; 782 | | s ^~~~~~~t atic ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 45109 warning | ss generatedt when compiling for agfx900t. ic ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { re45t warningusr generatedn when compiling for gfx90ac. omm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ 45 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclR45 warnings generated when compiling for gfx908. esult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx940. 45 warnings generated when compiling for gfx1100. 45 warnings generated when compiling for gfx1030. 45 warnings generated when compiling for gfx1101. 45 warnings generated when compiling for gfx90a. 45 warnings generated when compiling for gfx803. 45 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1675:11: warning: unused variable 'stackSize' [-Wunused-variable] 1675 | int64_t stackSize; | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1676:19: warning: unused variable 'devProp' [-Wunused-variable] 1676 | hipDeviceProp_t devProp; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1995:26: warning: unused variable 'payload' [-Wunused-variable] 1995 | NvtxParamsCommInitRank payload{myrank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2009:38: warning: unused variable 'CommInitAllSchema' [-Wunused-variable] 2009 | constexpr nvtxPayloadSchemaEntry_t CommInitAllSchema[] = { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2311:26: warning: unused variable 'payload' [-Wunused-variable] 2311 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:2341:26: warning: unused variable 'payload' [-Wunused-variable] 2341 | NvtxParamsCommInitRank payload{rank, nranks, cudaDev}; | ^~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:37: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:38: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:782:21: warning: unused function 'collNetTrySetup' [-Wunused-function] 782 | static ncclResult_t collNetTrySetup(ncclComm_t comm, ncclComm_t parent, struct ncclTopoGraph* collNetGraph) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/init.cc:1980:36: warning: unused variable 'CommInitRankSchema' [-Wunused-const-variable] 1980 | constexpr nvtxPayloadSchemaEntry_t CommInitRankSchema[] = { | ^~~~~~~~~~~~~~~~~~ 45 warnings generated when compiling for gfx942. 45 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/npkit.cc:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/shmutils.cc:17: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 53%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/signals.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 46%] Building CXX object CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -MF CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->deviIn file included from c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cce:s20(: n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:e294v:)5):; warning: rvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]e turn ncc l294S | u c c e sdse;f a}u l t| : ^~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h ^~~~~~~: 18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32: 118: | snote: tin instantiation of function template specialization 'ncclKernel' requested herea tic 32n | cIcMlPRLe_sMuAlItN__tK EcRoNl(l)N;e t G| e^t Prop/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:r368t:i3e:s (note: sexpanded from macro 'IMPL_MAIN_KERN't ruct 368n | c c lnCcocmlmK*e rcnoemlm<,f ailnste >d(ecvo,m mn,c cclhNaentnPerloMpaesrkt,i ewso_rtk*H epardo)p;s )\ { | N ^C CLCHECK(co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:m298-:>34n:c cnote: luninitialized use occurs hereC ollNe t298- | > g e t PcrooppyeTrotSihemse(md1e6v(,t ipdr%oWpAsR)P)_;S IrZeEt,u rdns tn,c cslrScu,c cbeystse;s )}; | | ^~~~~~~~~~~~~~~~~~~~ ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:t271u:r14n: nnote: cinitialize the variable 'dst' to silence this warningc lSuc 271 | void *dst, *src; | ^ | = nullptr cess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx90a. 28 warnings generated when compiling for gfx906. 28 warnings generated when compiling for gfx1102. 28 warnings generated when compiling for gfx908. 28 warnings generated when compiling for gfx941. 28 warnings generated when compiling for gfx1101. 28 warnings generated when compiling for gfx900. 28 warnings generated when compiling for gfx803. 28 warnings generated when compiling for gfx1100. 28 warnings generated when compiling for gfx1030. 28 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:330:34: warning: suggest braces around initialization of subobject [-Wmissing-braces] 330 | struct ncclWorkElemP2p elem = {0}; | ^ | {} In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:20: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:368:3: note: expanded from macro 'IMPL_MAIN_KERN' 368 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:294:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 294 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:32:1: note: in instantiation of function template specialization 'ncclKernel' requested here 32 | IMPL_MAIN_KERN(); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:373:3: note: expanded from macro 'IMPL_MAIN_KERN' 373 | ncclKernel(comm, channelMask, workHead); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:298:34: note: uninitialized use occurs here 298 | copyToShmem16(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:271:14: note: initialize the variable 'dst' to silence this warning 271 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:10: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:16:20: warning: unused function 'collNetName' [-Wunused-function] 16 | static const char* collNetName(struct ncclComm* comm) { return comm->ncclCollNet->name; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/enqueue.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:206:21: warning: unused function 'ncclGdrCudaFree' [-Wunused-function] 206 | static ncclResult_t ncclGdrCudaFree(void* gdrHandle) { | ^~~~~~~~~~~~~~~ 28 warnings generated when compiling for host. 28 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 51%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx941. 4 warnings generated when compiling for gfx1030. 4 warnings generated when compiling for gfx940. 4 warnings generated when compiling for gfx1101. 4 warnings generated when compiling for gfx1102. 4 warnings generated when compiling for gfx803. 4 warnings generated when compiling for gfx90a. 4 warnings generated when compiling for gfx906. 4 warnings generated when compiling for gfx900. 4 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:76:16: warning: unused variable 'status' [-Wunused-variable] 76 | mscclStatus& status = mscclGetStatus(); | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:132:27: warning: unused variable 'threadLocalStatus' [-Wunused-variable] 132 | mscclThreadLocalStatus& threadLocalStatus = mscclGetThreadLocalStatus(); | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_setup.cc:6: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/channel.h:41:21: warning: unused function 'ncclChannelCompute' [-Wunused-function] 41 | static ncclResult_t ncclChannelCompute(struct ncclComm* comm, int peer, int channelInc, int coll, int*channelId) { | ^~~~~~~~~~~~~~~~~~ 4 warnings generated when compiling for gfx942. 4 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/strongstream.cc gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 50%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:16: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:18: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:21: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:75:21: warning: unused function 'mscclXmlGetAttrInt' [-Wunused-function] 75 | static ncclResult_t mscclXmlGetAttrInt(struct mscclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:82:21: warning: unused function 'mscclXmlGetAttrInt64' [-Wunused-function] 82 | static ncclResult_t mscclXmlGetAttrInt64(struct mscclXmlNode* node, const char* attrName, int64_t* value) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/msccl/msccl_parser.h:89:21: warning: unused function 'mscclXmlFindTag' [-Wunused-function] 89 | static ncclResult_t mscclXmlFindTag(struct mscclXml* xml, const char* tagName, struct mscclXmlNode** node) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/msccl/msccl_lifecycle.cc:27:20: warning: unused variable 'mscclAlgoFilePathEnv' [-Wunused-variable] 27 | static const char* mscclAlgoFilePathEnv = "MSCCL_ALGO_FILE_PATH"; | ^~~~~~~~~~~~~~~~~~~~ 9 warnings generated when compiling for gfx942. 9 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/utils.cc:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/misc/socket.cc:8: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 54%] Building CXX object CMakeFiles/rccl.dir/hipify/src/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/net.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_socket.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/nvls.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx906. 5 warnings generated when compiling for gfx941. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx803. 5 warnings generated when compiling for gfx1100. 5 warnings generated when compiling for gfx90a. 5 warnings generated when compiling for gfx908. 5 warnings generated when compiling for gfx1030. 5 warnings generated when compiling for gfx940. 5 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/p2p.cc:13: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ 5 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/proxy.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -MF CMakeFiles/rccl.dir/hipify/src/proxy.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx908. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx900. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/proxy.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 56%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttrIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ (struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net_ib.cc:28: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:94:21: warning: unused function 'xmlGetAttrInt' [-Wunused-function] 94 | static ncclResult_t xmlGetAttrInt(struct ncclXmlNode* node, const char* attrName, int* value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:101:21: warning: unused function 'xmlGetAttrIntDefault' [-Wunused-function] 101 | static ncclResult_t xmlGetAttrIntDefault(struct ncclXmlNode* node, const char* attrName, int* value, int defaultValue) { | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:109:21: warning: unused function 'xmlGetAttrFloat' [-Wunused-function] 109 | static ncclResult_t xmlGetAttrFloat(struct ncclXmlNode* node, const char* attrName, float* value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:116:21: warning: unused function 'xmlFindTag' [-Wunused-function] 116 | static ncclResult_t xmlFindTag(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:128:21: warning: unused function 'xmlFindTagKv' [-Wunused-function] 128 | static ncclResult_t xmlFindTagKv(struct ncclXml* xml, const char* tagName, struct ncclXmlNode** node, const char* attrName, const char* attrValue) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:144:21: warning: unused function 'xmlSetAttr' [-Wunused-function] 144 | static ncclResult_t xmlSetAttr(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:157:21: warning: unused function 'xmlSetAttrIfUnset' [-Wunused-function] 157 | static ncclResult_t xmlSetAttrIfUnset(struct ncclXmlNode* node, const char* attrName, const char* value) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:169:21: warning: unused function 'xmlSetAttrInt' [-Wunused-function] 169 | static ncclResult_t xmlSetAttrInt(struct ncclXmlNode* node, const char* attrName, const int value) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:182:21: warning: unused function 'xmlSetAttrFloat' [-Wunused-function] 182 | static ncclResult_t xmlSetAttrFloat(struct ncclXmlNode* node, const char* attrName, const float value) { | ^~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:195:21: warning: unused function 'xmlUnsetAttr' [-Wunused-function] 195 | static ncclResult_t xmlUnsetAttr(struct ncclXmlNode* node, const char* attrName) { | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:207:21: warning: unused function 'xmlGetSub' [-Wunused-function] 207 | static ncclResult_t xmlGetSub(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:233:21: warning: unused function 'xmlGetSubKvInt' [-Wunused-function] 233 | static ncclResult_t xmlGetSubKvInt(struct ncclXmlNode* node, const char* subName, struct ncclXmlNode** sub, const char* attrName, const int attrValue) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:240:21: warning: unused function 'xmlAddNode' [-Wunused-function] 240 | static ncclResult_t xmlAddNode(struct ncclXml* xml, struct ncclXmlNode* parent, const char* subName, struct ncclXmlNode** sub) { | ^~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:256:21: warning: unused function 'xmlRemoveNode' [-Wunused-function] 256 | static ncclResult_t xmlRemoveNode(struct ncclXmlNode* node) { | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:276:21: warning: unused function 'kvConvertToInt' [-Wunused-function] 276 | static ncclResult_t kvConvertToInt(const char* str, int* value, struct kvDict* dict) { | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/xml.h:289:21: warning: unused function 'kvConvertToStr' [-Wunused-function] 289 | static ncclResult_t kvConvertToStr(int value, const char** str, struct kvDict* dict) { | ^~~~~~~~~~~~~~ 17 warnings generated when compiling for gfx942. 17 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx803. 1 warning generated when compiling for gfx906. 1 warning generated when compiling for gfx1102. 1 warning generated when compiling for gfx941. 1 warning generated when compiling for gfx940. 1 warning generated when compiling for gfx1030. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx1100. 1 warning generated when compiling for gfx1101. 1 warning generated when compiling for gfx90a. 1 warning generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for gfx942. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/shm.cc:7: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ 1 warning generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:187:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 187 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllGather, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ 21 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connecIn file included from tMap*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc :m8a: pIn file included from )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h :{11 : In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h ^~~~~~~~~~~~~~: 12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ ess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 21 warnings generated when compiling for gfx906. 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx908. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx1101. 21 warnings generated when compiling for gfx90a. 21 warnings generated when compiling for gfx900. 21 warnings generated when compiling for gfx803. 21 warnings generated when compiling for gfx1102. 21 warnings generated when compiling for gfx1100. 21 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:8: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:9: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:17:21: warning: unused function 'collNetDevices' [-Wunused-function] 17 | static ncclResult_t collNetDevices(struct ncclComm* comm, int* ndev) { NCCLCHECK(comm->ncclCollNet->devices(ndev)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:18:21: warning: unused function 'collNetGetProperties' [-Wunused-function] 18 | static ncclResult_t collNetGetProperties(struct ncclComm* comm, int dev, ncclNetProperties_t* props) { NCCLCHECK(comm->ncclCollNet->getProperties(dev, props)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:19:21: warning: unused function 'collNetListen' [-Wunused-function] 19 | static ncclResult_t collNetListen(struct ncclComm* comm, int dev, void* handle, void** listenComm) { NCCLCHECK(comm->ncclCollNet->listen(dev, handle, listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:20:21: warning: unused function 'collNetConnect' [-Wunused-function] 20 | static ncclResult_t collNetConnect(struct ncclComm* comm, void* handles[], int nranks, int rank, void* listenComm, void** collComm) { NCCLCHECK(comm->ncclCollNet->connect(handles, nranks, rank, listenComm, collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:21:21: warning: unused function 'collNetReduceSupport' [-Wunused-function] 21 | static ncclResult_t collNetReduceSupport(struct ncclComm* comm, ncclDataType_t dataType, ncclRedOp_t redOp, int* supported) { NCCLCHECK(comm->ncclCollNet->reduceSupport(dataType, redOp, supported)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:22:21: warning: unused function 'collNetRegMr' [-Wunused-function] 22 | static ncclResult_t collNetRegMr(struct ncclComm* comm, void* collComm, void* data, int size, int type, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMr(collComm, data, size, type, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:24:21: warning: unused function 'collNetRegMrDmaBuf' [-Wunused-function] 24 | static ncclResult_t collNetRegMrDmaBuf(struct ncclComm* comm, void* collComm, void* data, int size, int type, uint64_t offset, int fd, void** mhandle) { NCCLCHECK(comm->ncclCollNet->regMrDmaBuf(collComm, data, size, type, offset, fd, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:25:21: warning: unused function 'collNetDeregMr' [-Wunused-function] 25 | static ncclResult_t collNetDeregMr(struct ncclComm* comm, void* collComm, void* mhandle) { NCCLCHECK(comm->ncclCollNet->deregMr(collComm, mhandle)); return ncclSuccess; } | ^~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:26:21: warning: unused function 'collNetIallreduce' [-Wunused-function] 26 | static ncclResult_t collNetIallreduce(struct ncclComm* comm, void* collComm, void* sendData, void* recvData, int count, ncclDataType_t dataType, ncclRedOp_t redOp, void* sendMhandle, void* recvMhandle, void** request) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:28:21: warning: unused function 'collNetIflush' [-Wunused-function] 28 | static ncclResult_t collNetIflush(struct ncclComm* comm, void* collComm, void* data, int size, void* mhandle, void** request) { NCCLCHECK(comm->ncclCollNet->iflush(collComm, data, size, mhandle, request)); return ncclSuccess; } | ^~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:29:21: warning: unused function 'collNetTest' [-Wunused-function] 29 | static ncclResult_t collNetTest(struct ncclComm* comm, void* request, int* done, int* size) { NCCLCHECK(comm->ncclCollNet->test(request, done, size)); return ncclSuccess; } | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:30:21: warning: unused function 'collNetCloseColl' [-Wunused-function] 30 | static ncclResult_t collNetCloseColl(struct ncclComm* comm, void* collComm) { NCCLCHECK(comm->ncclCollNet->closeColl(collComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:31:21: warning: unused function 'collNetCloseListen' [-Wunused-function] 31 | static ncclResult_t collNetCloseListen(struct ncclComm* comm, void* listenComm) { NCCLCHECK(comm->ncclCollNet->closeListen(listenComm)); return ncclSuccess; } | ^~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/coll_net.h:33:12: warning: unused function 'collNetSupport' [-Wunused-function] 33 | static int collNetSupport(struct ncclComm* comm) { return comm->ncclCollNet != nullptr ? 1 : 0; } | ^~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:12: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/coll_net.cc:196:21: warning: unused function 'collNetDumpMap' [-Wunused-function] 196 | static ncclResult_t collNetDumpMap(struct connectMap* map) { | ^~~~~~~~~~~~~~ 21 warnings generated when compiling for gfx942. 21 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 57%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 55%] Building CXX object CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -MF CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o.d -o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx906. 11 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx900. 11 warnings generated when compiling for gfx1100. 11 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx1101. 11 warnings generated when compiling for gfx1102. 11 warnings generated when compiling for gfx1030. 11 warnings generated when compiling for gfx941. 11 warnings generated when compiling for gfx908. 11 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:168:14: warning: unused variable 'info' [-Wunused-variable] 168 | gdr_info_t info; | ^~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:170:12: warning: unused variable 'mh' [-Wunused-variable] 170 | gdr_mh_t mh; | ^~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:172:9: warning: unused variable 'gdrMap' [-Wunused-variable] 172 | void *gdrMap; | ^~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:201:19: warning: unused variable 'md' [-Wunused-variable] 201 | gdr_mem_desc_t *md = (gdr_mem_desc_t*)gdrHandle; | ^~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/comm.h:11: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/transport.h:12: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/graph.h:124: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/info.h:14: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/core.h:60: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/alloc.h:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/utils.h:40:13: warning: unused function 'log2i' [-Wunused-function] 40 | static long log2i(long n) { | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include/gdrwrap.h:161:14: warning: unused function 'ncclGdrInit' [-Wunused-function] 161 | static gdr_t ncclGdrInit() { | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:19: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:195:21: warning: unused function 'ncclTopoIdToIndex' [-Wunused-function] 195 | static ncclResult_t ncclTopoIdToIndex(struct ncclTopoSystem* system, int type, int64_t id, int* index) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:206:21: warning: unused function 'ncclTopoRankToIndex' [-Wunused-function] 206 | static ncclResult_t ncclTopoRankToIndex(struct ncclTopoSystem* system, int rank, int* index) { | ^~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:217:21: warning: unused function 'ncclTopoDevToRank' [-Wunused-function] 217 | static ncclResult_t ncclTopoDevToRank(struct ncclTopoSystem* system, int dev, int* rank) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/graph/topo.h:229:14: warning: unused function 'ncclTopoXGMISpeed' [-Wunused-function] 229 | static float ncclTopoXGMISpeed(const char* gcn) { | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/transport/net.cc:265:21: warning: unused function 'netDumpMap' [-Wunused-function] 265 | static ncclResult_t netDumpMap(struct connectMap* map) { | ^~~~~~~~~~ 11 warnings generated when compiling for gfx942. 11 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tiIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cppI:n1B: lIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.hc:k10(: tIn file included from h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hr:e168a: d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hI:d153x:.14x:) ,warning: unused variable 'data1' [-Wunused-variable]g roup(group), | 153 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:i562n:t603:2 _note: tfield 'group' will be initialized after field 'stepSize' data1 ,562 | f l a g 1t,i dd(attiad2),, fnltahgr2e;a d s| ( ^~~~~n thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ha:d153s:)21,: twarning: iunused variable 'flag1' [-Wunused-variable]d InBl o153c | k ( t h rueiandtI3d2x_.tx )d,a tgar1o,u pf(lgargo1u,p )d,a t a| 2 ^~~~~~~~~~~, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:60:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 60 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_gather.h:173:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 173 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllGather, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads)7, warning sw generatedi when compiling for dgfx906(. tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp510: | 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :s10t: eIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hS:i169z: e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h(:n509c:c29l:S hwarning: mfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]e m.comm.bu f507f | S i z e st[iNdC(CtLi_dP)R,O TnOt_hLrLe1a2d8s](/nNtChCrLe_aSdTsE)P,S /wsiidz(etoifd(%uWiAnRtP6_4S_ItZ)E)) ,{ w a| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ( t| i group(groupd /WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)533 :9: note: 508in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here | wa r533p | I n B l o c k ( tphrriemasd(Itdixd.-xn/tWhArRePa_dSsISZpEl)i,t , | n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e warp(tid/WARP_SIZEa ds- n509t | h r e a dfslSapglTihtr,e a&dt(r(etei-d>%u4p),= =t3r)e,e -g>rdoouwpn(,g raorugps)-,> s e| n ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~d b u| f warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3f , arg s510- | > r e c vsbtuefpfS,i z e| ( ^n cclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:.994c:o5m:m .note: bin instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested hereu ffSi z994e | s [ N C CrLu_nPTRrOeTeOS_pLlLi1t2<8T],/ NRCeCdLO_pS,T EPPrSo/tsoiLzLe1o2f8(>u(ianrtg6s4)_;t ) )| ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 491in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here: 9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here202 | 491 | R u n W o r k E lpermiemnst(>d(o)w.nr,u n&(twree)e;- > u| p ^, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpps:e8n:d1b:u fnote: fin instantiation of member function 'RunWork, 0, 1>::run' requested here, arg s8- | >IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSipzes[NClCiLt_P(Sa/rsgisz)e;o f (| u ^i nt64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : group(group202 :53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451: 9202: | note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here R u451n | W o r k E l e m epnrtitdoo>w(n),. rturne(ew-e>)d;o w n| , ^ args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpps:e9n:d1b:u fnote: fin instantiation of member function 'RunWork, 0, 1>::run' requested here, ar g9s | -I>MrPeLc_vCbOuLfLf_,F UaNrCg(sA-l>lrReeddOupcAer,g )T;R E E| , ^ LL128, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:u994m:P5o:s tnote: Din instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested herei v, i n994t | 6 4 _ t )r u n| T^r eeSp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:i391t:<95T:, note: Rexpanded from macro 'IMPL_COLL_FUNC'e dOp, P391r | o t oRLuLn1W2o8r>k(, 0, 1>::run' requested here# devre d202o | p < t y p e > , RNuCnCWLo_rAkLEGlOe_m#e#natlo(t)o.>r(u)n.(r&unnc(cwleS)h;m e m| . ^w ork); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp : 7| : ^1 : note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 25 warnings generated when compiling for gfx90a. 25 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncSum<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMin<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | primsIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cppt:i1d: -In file included from n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h10r: eIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:s169S: p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hl:i509t:,29 :n twarning: hfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]r eads-nthrea d507s | S p l i tt,i d&(ttriede)-,> unpt,h rteraedes-(>ndtohwrne,a dasr)g,s -w>isde(ntdibdu%fWfA,R Pa_rSgIsZ-E>)r,e cwvabrupf(ft,i d /| W ^A RP_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hZ:E994):,5 : | note: ~~~~~~~~~~~~~~~~~~in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 994 | 508 | r u n T rweaerSppIlniBtlS(IaZrEg)s,) ; | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^ | warp(tid/WARP_SIZE /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h509: | 202 : 53 : fnote: lin instantiation of member function 'RunWorkElement, 0, 1>::run' requested herea gThr e202a | d ( ( t i d % 4 )R=u=n3W)o,r kgErloeumpe(ngtrt(e)p.Sriuzne((wnec)c;l S h| m ^e m.com/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cppm:.14b:u1f:f Snote: iin instantiation of member function 'RunWork, 0, 1>::run' requested herez es[N C14C | LI_MPPRLO_TCOO_LLLL_1F2U8N]C/(NAClClLR_eSdTuEcPeS,/ sTiRzEeEo,f (LuLi1n2t86,4 _Mti)n), {r c c| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ b f| l group(groupo at16) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h^: 491:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here: 95: note: expanded from macro 'IMPL_COLL_FUNC' 491 | 391 | R upnrWiomrsk(cd#o#wdne,v r&etdroepe<-t>yuppe,> ,a rNgCsC-L>_sAeLnGdOb_u#f#fa,l gaor,g sN-C>CrLe_cPvRbOuTfOf_,# #aprrgost-o>>r(e)d.OrpuAnr(g&,n c0c*lPSrhomteom:.:wMoarxkG)r;o u\p W i| d ^t h); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ uff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncProd<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(argIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ s); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, argsIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ ->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclSh7 warnings generated when compiling for gfx1101. mem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncMax<__half>, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 60%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit<__half, FuncPreMulSum, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:451:9: note: in instantiation of member function 'Primitives, FanSymmetric<2>, 0, ProtoLL128, 0>::Primitives' requested here 451 | prims(tid, nthreads, tree->down, tree->down, args->sendbuff, args->recvbuff, args->redOpArg); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:491:9: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoLL128, 0>::Primitives' requested here 491 | prims(tid, nthreadsSplit, tree->down, &tree->up, args->sendbuff, args->recvbuff, args->redOpArg, 0*Proto::MaxGroupWidth); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:533:9: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoLL128, 0>::Primitives' requested here 533 | prims(tid-nthreadsSplit, nthreads-nthreadsSplit, &tree->up, tree->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:994:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeSplit, ProtoLL128>' requested here 994 | runTreeSplit(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 0, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, TREE, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 37 warnings generated when compiling for gfx90a. 37 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 58%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 59%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h3:2386_:t9 :d awarning: tvariable 'wireOffset' set but not used [-Wunused-but-set-variable]a 1, fl a386g | 1 , d aitnat2 ,w ifrleaOgf2f;s e t| ^~~~~= Wir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.he:Wo153r:d21P:e rwarning: Sunused variable 'flag1' [-Wunused-variable]l ice* w153a | r p + u2i*nwti3d2;_ t | d ^a ta1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp :u1i: nIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h3:210_: tIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:a169t: a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h1:,509 :f29l:a gwarning: 1field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor], data2, f507l | a g 2 ; t i| d ^~~~~( tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 63%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().rIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ un(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadId7 warnings generated when compiling for gfx940. x.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 64%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), gr7 warnings generated when compiling for gfx906. oup(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 97 | warningIsM generatedP when compiling for Lgfx940_. COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmeIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ m.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 65%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:986:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 986 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(AllReduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 68%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable]In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 69%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 514 : 9i:n twarning: variable 'offset' set but not used [-Wunused-but-set-variable]o ffs e514t | = t iidn;t o| f ^f set = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h): 386{: 9 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] | group(group 386 | int/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :w275i:r90e:O fnote: fin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres et = W i275r | e W o r d P ePrrSilmiictei*vweasr, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::324324::9090:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324324 | | PPrriimmiittiivveess<>,, //**DDiirreecctt==**//00,, PPrroottoo,, 00>> pprriimmss | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h595::5955::5 :note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herenote: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | 595 | r u nrTurneTerUepeDUopwDno >1(>a>r(gasr)g;s ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::20253::53 :note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | 202 | R uRnuWnoWrokrEklEelmeemnetno(>)(.)r.urnu(nw(ew)e;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp::16:: 1note: :in instantiation of member function 'RunWork, 0, 2>::run' requested here note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | I M6P | LI_MCPOLL_LC_OFLULN_CF(UANlCl(RAeldluRceed,u cTeR,E ET,R ESEI,M PSLIEM,P LSEu,m PSousmtPDoisvt,D int8_t) i v| ,^ int/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h3:2391_:t95): note: | expanded from macro 'IMPL_COLL_FUNC'^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391391: | 95 : Rnote: uexpanded from macro 'IMPL_COLL_FUNC'n Work#,d eNvCrCeLd_oApLa,l gNoC,C LN_CACLLG_OP_R#O#TaOl_g#o#,p rNoCtCoL>_(P)R.OrTuOn_(#&#npcrcoltSoh>m(e)m..rwuonr(k&)n;c c\l S h| m ^e m.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dnote: (field 'nthreads' will be initialized after field 'tidInBlock't id), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(60g:r onote: ufield 'group' will be initialized after field 'stepSize'p ), | 562 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nthreads(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx. x563) | , g r osutpe(pgSriozuep()n,c c l| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h m e| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). comm. b563u | f f S i zsetse[pNSCiCzLe_(PnRcOcTlOS_hSmIeMmP.LcEo]m/mN.CbCuLf_fSSTiEzPeSs/[sNiCzCeLo_fP(RTO)T)O _{S I M| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L E ]| / group(groupN CCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hz:e324o:f90(:T )note: )in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 324 | | group(group Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF anAsy m275m | e t r i c < 1P,r iNmCiCtLi_vMeAsX<_TD,E VR_eAdROIpT,Y >F,a n/A*sDyimrmeecttr=i*c/<0N,C CPLr_oMtAoX,_ D0E>V _pArRiImTsY , | 1 ^> , /*Di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e595c:t5=:* /note: 0in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here, Pro t595o | , 0 > rpurniTmrse e U| p ^D own, ProtoSimple<1, 1>>' requested hereP rot o595S | i m p l ere>U(paDrogwsn)<;T , | R ^e dOp, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:t202o:S53i:m pnote: lin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heree <1, 1202> | > ( a r g s ) ; R u| n ^W orkElement, 0, 2>::run' requested herep , Al g202o | , P r o t o > (R)u.nrWuonr(kwEel)e;m e n| t ^< Fn, T,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp :R4e:d1O:p ,note: in instantiation of member function 'RunWork, 0, 2>::run' requested hereA lgo, 4P | rIoMtPoL>_(C)O.LrLu_nF(UwNeC)(;A l l| R ^e duce, T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cppR:E5E:,1 :S Inote: Min instantiation of member function 'RunWork, 0, 2>::run' requested hereP LE, S5u | mIPMoPsLt_DCiOvL,L _iFnUtN8C_(tA)l l R| e^d uce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :T391R:E95E:, note: Sexpanded from macro 'IMPL_COLL_FUNC'I MPLE, S391u | m P oRsutnDWiovr,k <,n cNcClCFLu_nAcL#G#Of_u#n#ca,l gtoy,p eN,C CFLu_nPcR#O#TdOe_v#r#epdroopty(p)e.>r,u nN(C&CnLc_cAlLSGhOm_e#m#.awlogrok,) ;N C\C L _| P ^R OTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:r562o:t15o:> (note: )field 'nthreads' will be initialized after field 'tidInBlock'. run(& n562c | c l S h mteimd.(wtoirdk)),; n\t h r| e ^a ds(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,note: field 'nthreads' will be initialized after field 'tidInBlock't idInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ti d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o: c 7knote: | (field 'group' will be initialized after field 'stepSize'It MhPrLe_ aC562dO | IL dL x_ .F xUt)Ni,Cd ((gAtrliolduR)pe,(d gunrctoehu,rp e)Ta,Rd Es E(| ,n ^~~~~~~~~~~~~~~~~ t ShIrM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.heP:aL562dE:s,60) :,S unote: tmfield 'group' will be initialized after field 'stepSize'iP doIsn tB562Dl | io vc ,k ( uttihinrdte(3at2di_Idtd))x, . xn| )t^,h rgeraod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hus:p(391(n:gt95rh:or uenote: paexpanded from macro 'IMPL_COLL_FUNC')d ,s ) ,| ^~~~~~~~~~~391t | i d IRnuBnlWoocrkk(, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSiMAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ze(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ V_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.had:s562):,15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]I nBlock(threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s (| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hid:)562,: 15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads(nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o c k| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread I563d | x . x ) ,s tgerpoSuipz(eg(rnocucpl)S,h m e| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. c o| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m .buff S563i | z e s [ NsCtCeLp_SPiRzOeT(On_cScIlMSPhLmEe]m/.NcCoCmLm_.SbTuEfPfSS/isziezse[oNfC(CTL)_)P R{O T O| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S I M| P group(groupL E]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:T324E:P90S:/ snote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herez eof(T )324) | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ P r| i group(groupm itives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec <1, N C275C | L _ M A X _ DPErVi_mAiRtIiTvYe>s,< T/,* DRierdeOcpt,= *F/a0n,A sPyrmomteot,r i0c>< NpCrCiLm_sM A X| _ ^D EV_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hR:I595T:Y5,: 1note: >in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here, /*D i595r | e c t = *r/u0n,T rPereoUtpoD,o w0n>< Tp,r iRmesd O p| , ^ ProtoSimp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:e595<:15,: 1note: >in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here> (args )595; | | ^ runTr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:e202U:p53D:o wnote: nin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here< T, R202e | d O p , P r o tRouSniWmoprlkeE>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx941. 19 warnings generated when compiling for gfx908. 19 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 19 warnings generated when compiling for gfx90a. 19 warnings generated when compiling for gfx803. 19 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx1100. 19 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 19 warnings generated when compiling for host. 19 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ et = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWorIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ dPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 70%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitivest,i d/(*tDiidr)e,c tn=t*h/r0e,a dPsr(onttoh,r e0a>d sp)r,i mtsi d I| n ^B lock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h595r:e5a:d Inote: din instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herex .x), 595g | r o u p (rgurnoTurpe)e,U p D| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~w n <| T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), RedO p563, | P r o tsotSeipmSpilzee<(1n,c c1l>S>h(maermg.sc)o;m m .| b ^u ffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:[202N:C53C:L _note: Pin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereR OTO_ S202I | M P L E ] / N C CRLu_nSWToErPkSE/lseimzeenotf<(FTn),) T{, R| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d O p| , group(group Algo, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o275t:o90>:( )note: .in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer un(we) ;275 | | ^ Primitives, 0, 2>::run' requested here, Fan A4s | yImMmPeLt_rCiOcL,, S/I*MDPiLrEe,c tP=r*o/d0,, iPnrto8t_ot,) 0 >| ^p rims /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^: 95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hexpanded from macro 'IMPL_COLL_FUNC': 595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 391 | R595u | n W o r krp>e(>a,r gNsC)C;L _ A| L ^G O_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:o202,: 53N:C Cnote: Lin instantiation of member function 'RunWorkElement, 0, 2>::run' requested here_ PROT O202_ | # # p r o t o > (R)u.nrWuonr(k&EnlcecmleSnhtm (note: )field 'nthreads' will be initialized after field 'tidInBlock'. run(w e562) | ; | ^t id(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppd:)5,: 1n:t hnote: rin instantiation of member function 'RunWork, 0, 2>::run' requested heree ads( n5t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ITdRxE.Ex,) ,S IgMrPoLuEp,( gPrrooudp,) ,u i n| t ^~~~~~~~~~~~~~~~~8 _t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h): 562 :| 60^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(tid) ,391 | n t hRruenaWdosr(knr,o uNpC)C,L _ A| L ^~~~~~~~~~~G O_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /562* | D i r e ctti=d*(/t0i,d )P,r onttoh,r e0a>d sp(rnitmhsr e a| d ^s ), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d595I:n5B:l onote: cin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herek (thr e595a | d I d x .rxu)n,T rgereoUuppD(ogwrnop>S(iazreg(sn)c;c l S| h ^m em.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:m202.:b53u:f fnote: Sin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei zes[ N202C | C L _ P R O T O _RSuInMWPoLrE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ kElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:S562I:M15P:L Ewarning: ]initializer order does not match the declaration order [-Wreorder-ctor]/ NCCL_STEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i275d:I90n:B lnote: oin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec k(thr e275a | d I d x . x )P,r igmriotuipv(egsrm,m ./b*uDfifrSeiczte=s*[/N0C,C LP_rPoRtOoT,O _0S>I MpPrLiEm]s/ N C| C ^L _STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:i595z:e5o:f (note: Tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here) ) { 595| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupr unTreeUpDo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hw:n275<:T90,: Rnote: ein instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered Op, P r275o | t o S i m p lPerv>e(sa, 0, 2>::run' requested hereC L_MA X202_ | D E V _ A R I T YR,u n1W>o,r k/E*lDeimreenctt<=F*n/,0 ,T ,P rRoetdoO,p ,0 >A lpgroi,m sP r o| t ^o >().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:n595(:w5e:) ;note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here | ^ 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp :r6u:n1T:r enote: ein instantiation of member function 'RunWork, 0, 2>::run' requested hereU pDow n6< | TI,M PRLe_dCOOpL,L _PFrUoNtCo(SAilmlpRleedT>R(EaEr,g sS)I;M P L| E ^, Prod, i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t2023:253_:t )note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWorkE l391e | m e nRtu,( )F.urnucn#(#wdee)v;r e d| o ^p , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppN:C5C:L1_:A Lnote: Gin instantiation of member function 'RunWork, 0, 2>::run' requested hereO _##a l5g | oI,M PNLC_CCLO_LPLR_OFTUON_C#(#AplrloRteod>u(c)e.,r uTnR(E&En,c cSlISMhPmLeEm,. wPorrokd),; u\i n t| 8 ^_ t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hfield 'nthreads' will be initialized after field 'tidInBlock': 391:95: note: 562expanded from macro 'IMPL_COLL_FUNC' | ti d391( | t i dR)u,n Wnotrhkrr,o uNpC(CgLr_oAuLpG)O,_ # #| a ^~~~~~~~~~~~~~~~~l go, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L60_:P Rnote: Ofield 'group' will be initialized after field 'stepSize'T O_## 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:*562D:i15r:e cwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]= */0, Pro t562o | , 0 > tpirdi(mtsi d )| , ^ nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s595(:n5t:h rnote: ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herea 595 | ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement() .562r | u n ( w et)i;d ( t| i ^d ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpph:r8e:a1d:s (note: nin instantiation of member function 'RunWork, 0, 2>::run' requested heret hrea d8s | )I,M PtLi_dCIOnLBLl_oFcUkN(Ct(hArlelaRdeIdduxc.ex,) ,T RgErEo,u pS(IgMrPoLuEp,) ,P r o| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, i| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t 64_t) 563 | | ^ st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:p391S:i95z:e (note: nexpanded from macro 'IMPL_COLL_FUNC'c clShme m391. | c o mRmu.nbWuofrfkST,) )N C{C L _| A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L G O| _ group(group# #algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:C275L:_90P:R Onote: Tin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO _##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~o up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202562::5315:: note: warning: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 202 | 562 | R utniWdo(rtkiEdl)e,m enntthk(()t.hrruena(dwIed)x;. x )| , ^ group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppg:r8o:u1p:) ,note: in instantiation of member function 'RunWork, 0, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 8 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | IMPL_ C563O | L L _ F UsNtCe(pASlilzRee(dnucccel,S hTmReEmE.,c oSmImM.PbLuEf,f SPirzoeds,[ NiCnCtL6_4P_RtO)T O _| S^I MPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h]:/391N:C95C:L _note: Sexpanded from macro 'IMPL_COLL_FUNC'T EPS/si z391e | o f (RTu)n)W o{r k <| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c c l| F group(groupu nc##func/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 324t:y90p:e ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereF unc## d324e | v r e d o p t,i vNeCsCA(X)_.DrEuVn_(A&RnIcTcYl>S,h m/e*mD.iwroerckt)=;* /\0 , | P ^r oto, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h0:>562 :p15r:i mnote: sfield 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 595 : 5t:i dnote: (in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heret id) ,595 | n t h r eraudnsT(rnetehUrpeDaodwsn)<,T ,t iRdeIdnOBpl,o cPkr(otthorSeiamdpIldex<.1x,) ,1 >g>r(oaurpg(sg)r;o u p| ) ^, | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h202::56253::60 :note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herenote: field 'group' will be initialized after field 'stepSize' 202 | 562 | t iRdu(ntWiodr)k,E lnetmhernt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTree/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' : 562 | 324 : 90 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), nth r324e | a d s ( n t hPrreiamdist)i,v etsi562,: 60/:* Dnote: ifield 'group' will be initialized after field 'stepSize'r ect=* /5620 | , P r ottiod,( t0i>d )p,r inmtsh r e| a ^d s(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,595 :t5i:d Inote: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereB loc k595( | t h r e arduIndTxr.exe)U,p Dgorwonu>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDownwarning: >initializer order does not match the declaration order [-Wreorder-ctor]( args); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :t53i:d (note: tin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herei d), 202n | t h r e a d s ( nRtuhnrWeoardksE)l,e mteindtI((g)r.oruupn)(,w e )| ; ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp : 7 : 1s:t enote: pin instantiation of member function 'RunWork, 0, 2>::run' requested hereS ize( n7c | cIlMSPhLm_eCmO.LcLo_mFmU.NbCu(fAflSliRzeedsu[cNeC,C LT_RPEREO,T OS_ISMIPMLPEL,E ]P/rNoCdC,L _uSiTnEtP3S2/_sti)z e o| f^( T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~95 : | note: group(groupexpanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h391: | 275 : 90R:u nnote: Win instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo rkr,i cNP,R O/T*OD_i#r#epcrto=t*o/>0(,) .Prruont(o&,n c0c>l Sphrmiemms. w o| r ^k ); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^: 595:5: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock'595 | r562u | n T r e etUipdD(otwind<)T,, nRtehdrOepa,d sP(rnotthorSeiamdpsl)e,< 1t,i d1I>n>B(laorcgks()t;h r e| a ^d Idx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :g202r:o53u:p (note: gin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herer oup) ,202 | | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k Elemen t562< | F n , Tt,i dR(etdiOdp),, Anltghor,e aPdrso(tnot>h(r)e.ardusn)(,w et)i;d I n| B ^l ock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppe:a10d:I1d:x .note: xin instantiation of member function 'RunWork, 0, 2>::run' requested here) , g r10o | uIpM(PgLr_oCuOpL)L,_ F U| N ^~~~~~~~~~~C (AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mmetric<1, NCCL_MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h^: 562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 391warning: :initializer order does not match the declaration order [-Wreorder-ctor]95 : note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ P| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O TO_## p563r | o t o > (s)t.erpuSni(z&en(cncclcSlhSmhemme.mw.ocrokm)m;. b\u f f| S ^i zes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15P:R Onote: Tfield 'nthreads' will be initialized after field 'tidInBlock'O _SIMP L562E | ] / N C CtLi_dS(TtEiPdS)/,s inztehorfe(aTd)s)( n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ) group(group, tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:d275x:.90x:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup(g r275o | u p ) , | P ^~~~~~~~~~~~~~~~~r imi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562v:e60s:< Tnote: ,field 'group' will be initialized after field 'stepSize' RedOp ,562 | F a n A styimdm(ettirdi)c,< NnCtChLr_eMaAdXs_(DnEtVh_rAeRaIdTsY),, 1t>i,d I/n*BDliorcekc(tt=h*r/e0a,d IPdrxo.txo),, 0g>r opurpi(mgsr o u| p ^) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:15: warning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 563 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | s563t | e p S i zset(enpcScilzSeh(mnecmc.lcSohmmme.mb.ucfofmSmi.zbeusf[fNSCiCzLe_sP[RNOCTCOL__SPIRMOPTLOE_]S/INMCPCLLE_]S/TNECPCSL/_sSiTzEePoSf/(sTi)ze)o f{( T )| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ { | group(group| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h90::324 :note: 90in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | 324 | P r i mPirtiimvietsiI,T Y/>*,D i/r*eDcitr=e*c/t0=,* /P0r,o tPor,o t0o>, p0r>i mpsr i m| s ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 595note: :in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here5 : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | 595 | r u n T rreuenUTprDeoewUnp1>,( a1r>g>s()a;r g s| ) ^; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here: 53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here202 | 202 | R u n W o rRkuEnlWeomreknEtlP(r)o.trou>n(()w.er)u;n ( w| e ^) ; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp :13:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppnote: :in instantiation of member function 'RunWork, 0, 2>::run' requested here13 :1: note: 13in instantiation of member function 'RunWork, 0, 2>::run' requested here | IMPL _13C | OILMLP_LF_UCNOCL(LA_lFlURNeCd(uAclel,R eTdRuEcEe,, STIRMEPEL,E ,S IPMrPod, rLcEc,l _Pbrfoldo,a tr1c6c)l _ b| f^l oat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h1:6391): 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 :R95u:n Wnote: oexpanded from macro 'IMPL_COLL_FUNC'r kc,# #NdCeCvLr_eAdLoGpO<_t#y#pael>g,o ,N CNCCLC_LA_LPGROO_T#O#_a#l#gpor,o tNoC>C(L)_.PrRuOnT(O&_n#c#cplrSohtmoe>m(.)w.orrukn)(;& n\c c l| S ^h mem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:)15;: \note: field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dnote: (field 'nthreads' will be initialized after field 'tidInBlock't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i60d:( tnote: ifield 'group' will be initialized after field 'stepSize'd ), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~, group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:E15l:e mwarning: einitializer order does not match the declaration order [-Wreorder-ctor]n tr(e)a.drsu(nn(twher)e;a d s| ) ^, tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cppn:B12l:o1c:k (note: tin instantiation of member function 'RunWork, 0, 2>::run' requested hereh rea d12I | dIxM.PxL)_,C OgLrLo_uFpU(NgCr(oAulpl)R,e d u| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e , | T tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R EE, S I563M | P L E , sPtreopdS,i zdeo(unbclcel)S h m| e^m .co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:m391.:b95u:f fnote: Sexpanded from macro 'IMPL_COLL_FUNC'i zes[NC C391L | _ P RROuTnOW_oSrIkM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 275N:C90C:L _note: Ain instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL GO_## a275l | g o , N C CPLr_iPmRiOtTiOv_e#s#d(O)p.,r uFna(n&AnscycmlmSehtmreimc.:, note: /field 'nthreads' will be initialized after field 'tidInBlock'* Direc t562= | * / 0 , tPirdo(ttoi,d )0,> nptrhirmesa d s| ( ^n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a595d:s5):, note: tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herei dInB l595o | c k ( t hrruenaTdrIedexU.pxD)o,w ng >(arg s562) | ; | ^t id(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :n53t:h rnote: ein instantiation of member function 'RunWorkElement, 0, 2>::run' requested herea ds(n t202h | r e a d s ) , tRiudnIWnoBrlkoEclke(mtehnrte ^~~~~~~~~~~( ).run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hg:s386):;9 : | warning: ^variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53i:n tnote: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested herew ireO f202f | s e t = W i rReuWnoWrodrPkeErlSelmiecnet*().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 61%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up(gr o563u | p ) , s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e p S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z e(ncclSh m563e | m . c o msmt.ebpuSfifzSei(znecsc[lNSChCmLe_mP.RcOoTmOm_.SbIuMfPfLSEi]z/eNsC[CNLC_CSLT_EPPRSO/TsOi_zSeIoMfP(LTE)])/ N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S T E| P group(groupS /sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 275 :| 90 group(group: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 324 :P90r:i mnote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret ives< T324, | R e d O p ,P rFiamniAtsiyvmemse ,N C/C*LD_iMrAeXc_tD=E*V/_0A,R IPTrYo>t,o ,/ *0D>i rpercitm=s* / 0| , ^ Proto, 0> p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:i595m:s5 : | note: ^in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 595 : 5 :r unote: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereT reeU p595D | o w n < Tr,u nRTerdeOepU,p DPorwonto>t(oaSrigmsp)l;e < 1| , ^ 1>>(arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202;: 53 :| ^note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here RunW o202r | k E l e m e n t p(,) .Arlugno(,w eP)r;o t o| > ^( ).run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppe:)4;: 1 :| ^note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp | :I4M:P1L:_ Cnote: Oin instantiation of member function 'RunWork, 0, 2>::run' requested hereL L_FU N4C | (IAMlPlLR_eCdOuLcLe_,F UTNRCE(EA,l lSRIeMdPuLcEe,, STuRmE,E ,i nStI8M_PtL)E , | S^u m, i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t3918:_95t:) note: expanded from macro 'IMPL_COLL_FUNC'| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h391: | 391 : 95R:u nnote: Wexpanded from macro 'IMPL_COLL_FUNC'o rk#,d eNvCrCeLd_oApLa,l gNoC,C LN_CACLLG_PROTOO__####parlogtoo,> (N)C.CrLu_nP(R&OnTcOc_l#S#hpmreomt.ow>o(r)k.)r;u n\( & n| c ^c lShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562w:o15r:k )note: ;field 'nthreads' will be initialized after field 'tidInBlock' \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dnote: )field 'nthreads' will be initialized after field 'tidInBlock', nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~. x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:( gnote: rfield 'group' will be initialized after field 'stepSize'o up), 562| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(60t:i dnote: )field 'group' will be initialized after field 'stepSize', nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~) , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 202: | 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWorkElem e562n | t < F n ,t iTd,( tRiedd)O,p ,n tAhlrgeoa,d sP(rnotthor>e(a)d.sr)u,n (twied)I;n B l| o ^c k(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppa:d6I:d1x:. xnote: )in instantiation of member function 'RunWork, 0, 2>::run' requested here, gr o6u | pI(MgPrLo_uCpO)L,L _ F| U ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C (| A tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l lRed u563c | e , T RsEtEe,p SSiIzMeP(LnEc,c lSSuhmm,e mi.ncto3m2m_.tb)u f f| S^i zes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C391L:_95P:R Onote: Texpanded from macro 'IMPL_COLL_FUNC'O _SIMPLE ]391/ | N C CRLu_nSWToErPkS, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree >, NC C275L | _ A L G O _ #P#railmgiot,i vNeCsCy(m)m.erturni(c& , /*Dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:t562=:*15/:0 ,note: field 'nthreads' will be initialized after field 'tidInBlock'P roto, 5620 | > p r itmisd ( t| i ^d ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r595e:a5d:s (note: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heret hrea d595s | ) , t irduInnTBrleoecUkp(Dtohwrne >(ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562):;60 : | note: ^field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202562: | 53 : note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested heret id( t202i | d ) , n t h r eRaudnsW(onrtkhErleeamdesn)t,< Ftni,d ITn,B lRoecdkO(pt,h rAelagdoI,d xP.rxo)t,o >g(r)o.urpu(ng(rwoeu)p;) , | ^| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /* D562i | r e c t =t*i/d0(,t iPdr)o,t on,t h0r>e apdrsi(mnst h r| e ^a ds), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I595n:B5l:o cnote: kin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here( threa d595I | d x . x )r,u ngTrroeuepU(pgDroowunp<)T,, R| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d O p| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) Prot o563S | i m p l esz>e((anrcgcsl)S;h m e| m ^. comm.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i202z:e53s:[ Nnote: Cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereC L_PRO T202O | _ S I M P L E ] /RNuCnCWLo_rSkTEElPeSm/esnitz()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:u275n:(90w:e )note: ;in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp : 5 :P1r:i mnote: iin instantiation of member function 'RunWork, 0, 2>::run' requested heret ives <5T | ,I MRPeLd_OCpO,L LF_aFnUANsCy(mAmleltRreidcu ,u i/n*tD8i_rte)c t =| *^/ 0, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:t391o:,95 :0 >note: expanded from macro 'IMPL_COLL_FUNC'p rims | 391 ^ | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:k595<:n5c:c lnote: Fin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereu nc## f595u | n c , tryupneT,r eFeuUnpcD#o#wdner,o tNoCSCiLm_pAlLeGl>g(oa,r NCCL_PROTgOs_)#;# p r| o ^t o>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:&202n:c53c:l Snote: hin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herem em.wo r202k | ) ; \ | ^ RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:E562l:e15m:e nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'< Fn, T ,562 | R e d O pt,i dA(ltgiod,) ,P rnotthor>e(a)d.sr(unnt(hwree)a;d s )| , ^ tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppc:k7(:t1h:r enote: ain instantiation of member function 'RunWork, 0, 2>::run' requested hered Idx. x7) | ,I MgPrLo_uCpO(LgLr_oFuUpN)C,( A l| l ^~~~~~~~~~~~~~~~~R educ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:,562 :T60R:E Enote: ,field 'group' will be initialized after field 'stepSize' SIMPLE ,562 | S u m , tuiidn(tt3i2d_)t,) n t| h^r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's ), ti d391I | n B lRoucnkW(otrhkr, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562i:n15t:6 4warning: _initializer order does not match the declaration order [-Wreorder-ctor]t ) | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'( tid), n t391h | r e aRdusn(Wnotrhkrp,) ,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ A L| G tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O _##alg o563, | N C C Ls_tPeRpOSTiOz_e#(#npcrcoltSoh>m(e)m..rcuonm(m&.nbcucflfSShimzeems.[wNoCrCkL)_;P R\O T O| _ ^S IMPLE]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_15S:T Enote: Pfield 'nthreads' will be initialized after field 'tidInBlock'S /sizeo f562( | T ) ) {t i d| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ) group(group, nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t324h:r90e:a note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ds )324, | t i d I n BPlroicmki(ttihvreesa | , / * Dtiirde(ctti=d*)/,0 ,n tPhrroetaod,s (0n>t hprreiamdss ) ,| ^t idInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:o595c:k5(:t hnote: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree adI d595x | . x ) , rgurnoTurpe(egUrpoDuopw)n,< T ,| ^~~~~~~~~~~R edOp, ProtoSimple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 562 | 202 | t i d ( t i d )R,u nnWtohrrkeEaldesm(enntthd(x)..xr)u,n (gwreo)u;p ( g| r ^o up), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp :| 7 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 1 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5637 | | I M P Ls_tCeOpLSLi_zFeU(NnCc(cAllSlhRmeedmu.cceo,m mT.RbEuEf,f SSiIzMePsL[EN,C CSLu_mP,R OuTiOn_tS3I2M_PtL)E ] /| N^C CL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:P391S:/95s:i znote: eexpanded from macro 'IMPL_COLL_FUNC'o f(T)) {391 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R u n| W group(groupo rk, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret ype, F u275n | c # # d e v rPerdiompi<,T ,N CRCeLd_OApL,G OF_a#n#Aaslygmom,e tNrCiCcL<_NPCRCOLT_OM_A#X#_pDrEoVt_oA>R(I)T.Yr,u n1(>&,n c/c*lDSihrmeecmt.=w*o/r0k,) ;P r\o t o| , ^ 0> pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:s562 : 15| : ^ note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595: 5562: | note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here ti d595( | t i d ) ,r unntThrreeeaUdpsD(onwtnhx>.(xa)r,g sg)r;o u p| ( ^g roup), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 ^~~~~~~~~~~~~~~~~: 53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here562 :60: note: 202field 'group' will be initialized after field 'stepSize' | 562 | R u n Wtoirdk(Etliedm)e,n tno(c)k.(rtuhnr(ewaed)I;d x .| x ^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppu:p7(:g1r:o unote: pin instantiation of member function 'RunWork, 0, 2>::run' requested here) , | ^~~~~~~~~~~7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s15 : | warning: ^initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595 :5625 | : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here tid( t595i | d ) , nrtuhnrTeraedesU(pnDtohwrnex>)(,a rggrso)u;p ( g| r ^o up), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 202 :| 53 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 563 | 202 | s t e p S i z eR(unncWcolrSkhEmleemm.ecnotm().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:)562): 15{: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t324i:d90):, note: nin instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hreads (324n | t h r e a d sP)r,i mtiitdiIvneBsl , / *sDtierpeScitz=e*(/n0c,c lPSrhomteom,. c0o>m mp.rbiumfsf S i| z ^e s[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:P595R:O5T:O _note: Sin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested hereI MPLE ]595/ | N C C L _rSuTnETPrSe/esUipzDeoowfn( >note: (in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea rgs) ;275 | | ^ Pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:i202t:i53v:e snote: , 0, 2>::run' requested hereT , Re d202O | p , F a n A s yRmumneWtorrikcEl,g o/,* DPirroetcot>=(*)/.0r,u nP(rwoet)o;, 0| > ^ prims /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp| : ^10 :1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'RunWork, 0, 2>::run' requested here595 :5: 10note: | in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereI MPL _595C | O L L _ FrUuNnCT(rAelelURpeDdouwcne<,T ,T RREeEd,O pS,I MPPrLoEt,o SSiummp,l eh >| (^a rgs)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 391 :| 95 ^: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :39153 | : note: Rin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereu nWo r202k | < n c c l F u n cR#u#nfWuonrck,E lteympeen,t ,P rNoCtCoL>_(A)L.GrOu_n#(#wael)g;o , | N ^C CL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cppR:O11T:O1_:# #note: pin instantiation of member function 'RunWork, 0, 2>::run' requested herer oto >11( | )I.MrPuLn_(C&OnLcLc_lFSUhNmCe(mA.lwloRrekd)u;c e\, T| R ^E E, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15,: Snote: ufield 'nthreads' will be initialized after field 'tidInBlock'm , floa t562) | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:)95,: nnote: texpanded from macro 'IMPL_COLL_FUNC'h reads( n391t | h r eRaudnsW)o,r kt, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:A60L:G Onote: _field 'group' will be initialized after field 'stepSize'# #algo ,562 | N C C L _tPiRdO(TtOi_d#)#,p rnotthor>e(a)d.sr(unnt(h&rnecacdlsS)h,m etmi.dwIonrBkl)o;c k\( t h| r ^e adIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(gro u562p | ) , | t ^~~~~~~~~~~i d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15 :563 | warning: initializer order does not match the declaration order [-Wreorder-ctor] stepSi z562e | ( ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUN C202( | A l l R e d u c eR,u nTWRoErEk,E lSeImMePnLtE<,F nS,u mT,, fRleodaOtp), A| l^g o, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391t:o95>:( )note: .expanded from macro 'IMPL_COLL_FUNC'r un(we) ;391 | | ^R unWork, 0, 2>::run' requested here, ty p10e | ,I MFPuLn_cC#O#LdLe_vFrUeNdCo(pAu,c eN,C CTLR_EAEL,G OS_I#M#PaLlEg,o ,S uNmC,C Lh_aPlRfO)T O _| #^# proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:(391):.95r:u nnote: (expanded from macro 'IMPL_COLL_FUNC'& ncclS h391m | e m .RwuonrWko)r;k <\n c c| l ^F unc##f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562c:,15 :t ynote: pfield 'nthreads' will be initialized after field 'tidInBlock'e , Func #562# | d e v r etdiodp( ,n tNhCrCeLa_dAsL(GnOt_h#r#eaaldgso),, NtCiCdLI_nPBRlOoTcOk_(#t#hprreoatdoI>d(x)..xr)u,n (g&rnocucpl(Sghrmoeump.)w,o r k| ) ^~~~~~~~~~~~~~~~~; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~( group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~i dInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562\: 15 :| ^warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p), | 563 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t60e:p Snote: ifield 'group' will be initialized after field 'stepSize'z e(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.60x:) ,note: field 'group' will be initialized after field 'stepSize'g roup( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), nthre a563d | s ( n t hsrteeapdSsi)z,e (tnicdcIlnSBhlmoecmk.(ctohmm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d swarning: (initializer order does not match the declaration order [-Wreorder-ctor]n threads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o c k| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread I563d | x . x ) ,s tgerpoSuipz(eg(rnocucpl)S,h m e| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. c o| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m .buff S563i | z e s [ NsCtCeLp_SPiRzOeT(On_cScIlMSPhLmEe]m/.NcCoCmLm_.SbTuEfPfSS/isziezse[oNfC(CTL)_)P R{O T O| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S I M| P group(groupL E]/NCCL_STEPS/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:(275T:)90): {note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :P324r:i90m:i tnote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herev es1,, /N*CDCiLr_eMcAtX=_*D/E0V,_ APRrIoTtYo>,, 0/>* Dpirriemcst = *| / ^0 , Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:t595o:,5 :0 >note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herep rims 595 | | ^ ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:T595r:e5e:U pnote: Din instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereo wn< T595, | R e d Orpu,n TPrreoetUopSDiomwpnlO>p(,a rPgrso)t;o S i| m ^p le<1, 1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:>202(:a53r:g snote: )in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here; | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 :R unote: nin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereW orkE l202e | m e n t < F n , RTu,n WRoerdkOEpl,e mAelngto<,F nP,r oTt,o >R(e)d.Orpu,n (Awleg)o;, P| r ^o to>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.:r6u:n1(:w enote: )in instantiation of member function 'RunWork, 0, 2>::run' requested here; | ^6 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp_:C7O:L1L:_ Fnote: Uin instantiation of member function 'RunWork, 0, 2>::run' requested hereN C(A l7l | RIeMdPuLc_eC,O LTLR_EFEU,N CS(IAMlPlLREe,d uMcaex,, TiRnEtE3,2 _StI)M P L| E^, Max,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :u391i:n95t:3 2note: _expanded from macro 'IMPL_COLL_FUNC't ) | ^ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 :R95u:n Wnote: oexpanded from macro 'IMPL_COLL_FUNC'r kc,# #NdCeCvLr_eAdLoGpO<_t#y#pael>g,o ,N CNCCLC_LA_LPGROO_T#O#_a#l#gpor,o tNoC>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 275 | P562r | i m i t itvieds(.,x )/,* Dgirroeucpt(=g*r/o0u,p )P,r o t| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, 0| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) prims 563 | | ^ st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:p595S:i5z:e (note: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herec clSh m595e | m . c o mrmu.nbTurfefeSUipzDeosw[nNS>/(sairzgeso)f;( T )| ) ^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202 group(group: 53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 324202: | 90 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Run W324o | r k E l e m ePnrtie(t)r.ircunote: ,in instantiation of member function 'RunWork, 0, 2>::run' requested here /*D i7r | eIcMtP=L*_/C0O,L LP_rFoUtNoC,( A0l>l Rperdiumcse , | T ^R EE, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:M595P:L5E:, note: Min instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herea x, u i595n | t 3 2 _ tr)u n T| r^e eUpDo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hw:n391<:T95,: Rnote: eexpanded from macro 'IMPL_COLL_FUNC'd Op, Pr o391t | o S iRmupnlWeoc>l(Faurngcs#)#;f u n| c ^, type,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :F202u:n53c:# #note: din instantiation of member function 'RunWorkElement, 0, 2>::run' requested heree vred o202p | < t y p e > , NRCuCnLW_oArLkGEOl_e#m#eanltgr(o)t.or>u(n)(.&rnucnc(lwSeh)m;e m .| w ^o rk); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp\: 5 :| 1 ^: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 155: | Inote: Mfield 'nthreads' will be initialized after field 'tidInBlock'P L_COL L562_ | F U N C (tAildl(Rteiddu)c,e ,n tThRrEeEa,d sS(InMtPhLrEe,a dMsa)x,, tuiidnItn8B_lto)c k (| t^h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I391d:x95.:x )note: ,expanded from macro 'IMPL_COLL_FUNC' group(g r391o | u p )R,u n W| o ^~~~~~~~~~~~~~~~~r kt,h rNeCaCdLs_)A,L GtOi_d#I#naBllgooc,k (NtChCrLe_aPdRIOdTxO._x#)#,p rgortoou>p(()g.rrouunp()&,n c c| l ^~~~~~~~~~~S hmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hCCL_PR:O562T:O15_:# #warning: prinitializer order does not match the declaration order [-Wreorder-ctor]o to>().run(&nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock') , tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I n| B tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l ock(t h563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~~~~~~~. bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562S:i60z:e snote: [field 'group' will be initialized after field 'stepSize'N CCL_ P562R | O T O _ StIiMdP(LtEi]d/)N,C CnLt_hSrTeEaPdSs/(snitzheroefa(dTs))), {t i d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n B l| o group(groupc k(threadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r324o:u90p:( gnote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up), 324| | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:T562):)15 :{ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h275r:e90a:d snote: (in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren thread s275) | , t i d I nPBrliomcikt(itvhersep,S i/z*eD(inrceccltS=h*m/e0m,. cPormomt.ob,u f0f>S ipzreism[sN C C| L ^_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:_595S:I5M:P Lnote: Ein instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here] /NCC L595_ | S T E P Sr/usniTzreeoefU(pTD)o)w n{< T ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R e d| O group(groupp , ProtoS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:m275p:l90e:< 1note: ,in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 1>>(ar g275s | ) ; | ^ Primiti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hv:e202s:<53T:, note: Rin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heree dOp, 202F | a n A s y m m e tRruincWp,, /A*lDgior,e cPtr=o*t/o0>,( )P.rroutno(,w e0)>; p r| i ^m s | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::1595:: 5note: :in instantiation of member function 'RunWork, 0, 2>::run' requested here note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 8 | I595M | P L _ C OrLuLn_TFrUeNeCU(pADlolwRnet>6(4a_rtg)s ) ;| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::202 :note: 53expanded from macro 'IMPL_COLL_FUNC': note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 391 | 202 | R u n W o r k e(>),. rNuCnC(Lw_eA)LGO_##algo;, N| C ^C L_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cppO:_9#:#1p:r onote: tin instantiation of member function 'RunWork, 0, 2>::run' requested hereo >(). r9u | nI(M&PnLc_cClOSLhLm_eFmU.NwCo(rAkl)l;R e\d u c| e ^, TREE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :S562I:M15P:L Enote: ,field 'nthreads' will be initialized after field 'tidInBlock' Max, u562i | n t 6 4 _tti)d ( t| i^d ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'( nthrea d391s | ) , RtuindWIonrBkl60,: Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_AL G562O | _ # # a ltgiod,( tNiCdC)L,_ PnRtOhTrOe_a#d#sp(rnotthor>e(a)d.sr)u,n (t&indcIcnlBSlhomcekm(.twhorreka)d;I d\x . x| ) ^, gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r onote: ufield 'nthreads' will be initialized after field 'tidInBlock'p ), | ^~~~~~~~~~~562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 1>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562(:A15l:l Rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]d uce, TRE E562, | S I M PtLiEd,( tMiadx),, fnltoharte)a d s| (^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391):,95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'I nBlock (391t | h r eRaudnIWdoxr.kx<)n,c cglrFouunpc(#g#rfouunpc),, t y| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e , | F tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u nc## d563e | v r e d osptp,S iNzCeC(Ln_cAcLlGSOh_m#e#ma.lcgoom,m .NbCuCfLf_SPiRzOeTsO[_N#C#CpLr_oPtRoO>T(O)_.SrIuMnP(L&En]c/cNlCSChLm_eSmT.EwPoSr/ks)i;z e\o f (| T ^) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : group(group15 : note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90 :562 | note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ti d275( | t i d ) , nPtrhirmeiatdisv(enst,, /| * ^~~~~~~~~~~~~~~~~D ire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:t562=:*60/:0 ,note: field 'group' will be initialized after field 'stepSize'P roto, 5620 | > p r itmisd ( t| i ^d ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r595e:a5d:s (note: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heret hrea d595s | ) , t irduInnTBrleoecUkp(Dtohwrne >(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here562 | 10t | iIdM(PtLi_dC)O,L Ln_tFhUrNeCa(dAsl(lnRtehdruecaed,s )T,R EtEi,d ISnIBMlPoLcEk,( tMharxe,a dhIadlxf.)x ) ,| ^g roup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:)95,: note: | expanded from macro 'IMPL_COLL_FUNC' ^~~~~~~~~~~~~~~~~ 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 :R60u:n Wnote: ofield 'group' will be initialized after field 'stepSize'r kI,n BNlCoCcLk_(AtLhGrOe_a#d#Iadlxg.ox,) ,N CgCrLo_uPpR(OgTrOo_u#p#)p,r o t| o ^~~~~~~~~~~> ().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thr e563a | d I d x .sxt)e,p Sgirzoeu(pn(cgcrloSuhpm)e,m . c| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m m .| b tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u ffSize s563[ | N C C L _sPtReOpTSOi_zSeI(MnPcLcEl]S/hNmCeCmL._cSoTmEmP.Sb/usfifzSeiozfe(sT[)N)C C{L _ P| R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O T O| _ group(groupS IMPLE]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:C275L:_90S:T Enote: Pin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereS /size o275f | ( T ) ) { P r| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m i t| i group(groupv es, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres ymmetr i324c | < N C C L _ MPArXi_mDiEtVi_vAeRsId,O p/,* DFiarneAcsty=m*m/e0t,r iPcrL _pMrAiXm_sD E V| _ ^A RITY/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>:,595 :/5*:D inote: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree ct= *595/ | 0 , P rroutnoT,r e0e>U ppDroiwmns< T ,| ^R edOp, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:t595o:S5i:m pnote: lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree <1, 1 >595> | ( a r g sr)u; | ^ nTreeU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:D202o:w53n:< Tnote: ,in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here Red O202p | , P r o t o S iRmupnlWeom>e(natr, 0, 2>::run' requested heret o>() .202r | u n ( w e ) ; R| u ^n WorkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cppe:m12e:n1t:< Fnote: nin instantiation of member function 'RunWork, 0, 2>::run' requested here, T, 12R | eIdMOPpL,_ CAOlLgLo_,F UPNrCo(tAol>l(R)e.druucne(,w eT)R;E E ,| ^S IMPLE, M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cppa:x9,: 1d:o unote: bin instantiation of member function 'RunWork, 0, 2>::run' requested herel e) | 9^ | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:C391O:L95L:_ Fnote: Uexpanded from macro 'IMPL_COLL_FUNC'N C(All R391e | d u cReu,n WToRrEkE<,n cScIlMFPuLnEc,# #Mfauxn,c ,u itnytp6e4,_ tF)u n c| #^# devre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:o391p:<95t:y pnote: eexpanded from macro 'IMPL_COLL_FUNC'> , NCCL _391A | L G OR_u#n#Waolrgko<,n cNcClCFLu_nPcR#O#TfOu_n#c#,p rtoytpoe>,( )F.urnucn#(#&dnecvcrleSdhompek,) ;N C\C L _| A ^L GO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:l562g:o15,: Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_PRO T562O | _ # # p rtoitdo(>t(i)d.)r,u nn(t&hnrcecaldSsh(mnetmh.rweoardks));, \t i d| I ^n Block/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'd x.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~t hre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~l ock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d Inote: dfield 'group' will be initialized after field 'stepSize'x .x), g562r | o u p ( gtriodu(pt)i,d ) , | n ^~~~~~~~~~~t hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]n Block(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s), t563i | d I n B lsotcekp(Stihzree(andcIcdlxS.hxm)e,m .gcroomump.(bgurfofuSpizes[NCCL)_,P R O| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O _ S| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PLE]/ N563C | C L _ S TsEtPeSp/Ssiizzee(onfc(cTl)S)h m{e m .| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o m m| . group(groupb uffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C324C:L90_:P Rnote: Oin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_SIM P324L | E ] / N C C LP_rSiTmEiPtSi/vseisz, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereX _DEV_A R324I | T Y > , / *PDriirmeictti=v*e/s0<,T ,P rRoetdoO,p ,0 >F apnrAismysm m e| t ^r ic<1,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :N595C:C5L:_ Mnote: Ain instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereX _DE V595_ | A R I T Yr>u,n T/r*eDeiUrpeDcotw=n*o tporSiimmsp l e| < ^1 , 1>>(ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:s595):;5 : | note: ^in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h595: | 202 : 53 : rnote: uin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heren Tre e202U | p D o w n < T , RRuendWOopr,k EPlreomteonStid>O(pa,r gAsl)g;o , | P ^r oto>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202(:w53e:) ;note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp : 13 : 1 : note: in instantiation of member function 'RunWork, 0, 2>::run' requested hereR unW o13r | kIEMlPeLm_eCnOtLM(P)L.Er,u nM(awxe,) ;r c c| l ^_ bfloat1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp6:)9 : 1| :^ note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :995 | :I Mnote: Pexpanded from macro 'IMPL_COLL_FUNC'L _COLL_ F391U | N C (RAulnlWRoerdku391,: 95N:C Cnote: Lexpanded from macro 'IMPL_COLL_FUNC'_ ALGO_# #391a | l g oR,u nNWCoCrLk_,( )t.yrpuen,( &Fnucnccl#S#hdmeevmr.ewdoorpk<)t;y p\e > ,| ^N CCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:G562O:_15#:# anote: lfield 'nthreads' will be initialized after field 'tidInBlock'g o, NC C562L | _ P R O TtOi_d#(#tpirdo)t,o >n(t)h.rreuand(s&(nnctchlrSehamdesm).,w otrikd)I;n B\l o c| k ^( thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'g roup( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~t id)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~r eadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x60):, note: gfield 'group' will be initialized after field 'stepSize'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~( tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock't id(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up), 563| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562e:p60S:i znote: efield 'group' will be initialized after field 'stepSize'( ncclSh m562e | m . c o mtmi.db(utfifdS)i,z enst[hNrCeCaLd_sP(RnOtThOr_eSaIdMsP)L,E ]t/iNdCICnLB_lSoTcEkP(St/hsriezaedoIfd(xT.)x)) ,{ g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p (| g group(groupr oup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~: 275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h group(group: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i275d:(90t:i dnote: )in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, nthre a275d | s ( n t h r ePardism)i,t itviedsI, 563/ | * D i r esctte=p*S/i0z,e (PnrcoctloS,h m0e>m .pcroimmms. b u| f ^f Sizes[NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:T595O:_5S:I Mnote: Pin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereL E]/N C595C | L _ S T ErPuSn/TsriezeeUopfD(oTw)n)< T{, R| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d O p| , group(group ProtoSimp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:e275<:190,: 1note: >in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here> (args )275; | | ^ Primiti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hv:e202s:<53T:, note: Rin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heree dOp, 202F | a n A s y m m e tRruincWp,, /A*lDgior,e cPtr=o*t/o0>,( )P.rroutno(,w e0)>; p r| i ^m s | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 595note: :in instantiation of member function 'RunWork, 0, 2>::run' requested here5 : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here4 | IMP L595_ | C O L L _rFuUnNTC(AllRerdeuecUep,D oTwRnE >(arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)391;: 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' 391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 :R53u:n Wnote: oin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herer k ,A lNgCoC,L _PArLoGtOo_>#(#)a.lrguon,( wNeC)C;L _ P| R ^O TO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppo:>5(:)1.:r unote: nin instantiation of member function 'RunWork, 0, 2>::run' requested here( &ncc l5S | hImMePmL._wCoOrLkL)_;F U\N C (| A ^l lReduc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:,562 :T15R:E Enote: ,field 'nthreads' will be initialized after field 'tidInBlock' SIMP L562E | , M i nt,i du(itnitd8)_,t )n t h| r^e ads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95):, note: texpanded from macro 'IMPL_COLL_FUNC'i dInBlock (391t | h r eRaudnIWdoxr.kx<)n,c cglrFouunpc(#g#rfouunpc),, t y| p ^~~~~~~~~~~~~~~~~e , F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562c:#60#:d enote: vfield 'group' will be initialized after field 'stepSize'r edop< t562y | p e > , tNiCdC(Lt_iAdL)G,O _n#t#harlegaod,s (NnCtChLr_ePaRdOsT)O,_ #t#ipdrIontBol>o(c)k.(rtuhnr(e&andcIcdlxS.hxm)e,m .gwroorukp)(;g r\o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp :s1t: eIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:i10z: eIn file included from (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hn:c167c: l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e15m:. cwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]m m.buffSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o324u:p90):, note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 324 | 563 | P rsitmeiptSiivzees(C,L _/S*TDEiPrSe/csti=z*e/o0f,( TP)r)o t{o , | 0 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~> p| r group(groupi ms | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::595275::590:: note: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herein instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 595 | 275 | r u n TPrreiemUiptDiovwens<C>L(_aMrAgXs_)D;E V _| A ^R ITY, 1>,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :/202*:D53i:r enote: cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested heret =*/0 ,202 | P r o t o , 0 >R upnrWiomrsk E l| e ^m ent, ProtoSimple<1, 1>>' requested hereA lgo ,595 | P r o t or>u(n)T.rreuenU(pwDeo)w;n < T| , ^ RedOp, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cppr:o4t:o1S:i mnote: pin instantiation of member function 'RunWork, 0, 2>::run' requested herel e<1, 41 | >I>M(PaLr_gCsO)L;L _ F| U ^N C(AllRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:u202c:e53,: Tnote: Rin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereE E, S202I | M P L E , M i nR,u niWnotr8k_Etl)e m e| n^t R(u)n.Wrournk(, 0, 2>::run' requested here Fun c4# | #IdMePvLr_eCdOoLpL<_tFyUpNeC>(,A lNlCRCeLd_uAcLeG,O _T#R#EEa,l gSoI,M PNLCEC,L _MPiRnO,T Oi_n#t#8p_rto)t o >| (^) .run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:&391n:c95c:l Snote: hexpanded from macro 'IMPL_COLL_FUNC'm em.wo r391k | ) ; R\u n W| o ^r ke,a dNsC(CnLt_hArLeGaOd_s#)#,a ltgiod,I nNBClCoLc_kP(RtOhTrOe_a#d#Ipdrxo.txo)>,( )g.rrouunp((&gnrcoculpS)h,m e m| . ^~~~~~~~~~~~~~~~~w ork)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562\: 60 :| ^note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~r oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE,g rMoiunp,( girnotu6p4)_,t ) | ^~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~x ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15 :562 | warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~r ou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(t i563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzherse[aNdCICdLx_.PxR)O,T Og_rSoIuMpP(LgEr]o/uNpC)C,L _| S ^~~~~~~~~~~T EPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pe, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :13562 | :I15M:P Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]C OLL_FUN C562( | A l l R etdiudc(et,i dT)R,E En,t hSrIeMaPdLsE(,n tMhirne,a drsc)c,l _tbifdlIonaBtl1o6c)k ( t| h^r eadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:)391,: 95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p (group), 391 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)W ork< n563c | c l F u nsct#e#pfSuinzce,( ntcycpleS,h mFeumn.cc#o#mdme.vbruefdfoSpiC,C LN_CPCRLO_TAOL_GSOI_M#P#LaEl]g/oN,C CNLC_CSLT_EPPRSO/TsOi_z#e#opfr(oTt)o)> ({) . r| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n ( &| n group(groupc clShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hw:o324r:k90):; note: \in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 324 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562P:r15i:m inote: tfield 'nthreads' will be initialized after field 'tidInBlock'i vesl,o c/k*(Dtihrreecatd=I*d/x0.,x )P,r ogtroo,u p0(>g rporuipm)s, | | ^ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::595562::560:: note: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herefield 'group' will be initialized after field 'stepSize' 595 | 562 | r u n Ttriede(UtpiDdo)w,n o>c(ka(rtghsr)e;a d I| d ^x .x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:( gnote: rin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereo up) ,202 | | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1560:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h562::32415::90 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324562 | | t iPdr(itmiidt)i,v enstu,p (/g*rDoiurpe)c,t = *| / ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~0 , | P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oto, 0563> | p r i msst e p| S ^i ze(nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:l595S:h5m:e mnote: .in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested herec omm. b595u | f f S i zreusn[TNrCeCeLU_pPDRoOwTnO<_TS,I MRPeLdEO]p/,N CPCrLo_tSoTSEiPmSp/lseiT>)()a { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rgs); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562AL:G15O:_ #warning: #initializer order does not match the declaration order [-Wreorder-ctor]a lgo, NCCL_P R562O | T O _ # #tpirdo(ttoi>d()),. rnutnh(r&enacdcsl(Snhtmherme.awdosr)k,) ;t i\d I n| B ^l ock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Inote: dfield 'nthreads' will be initialized after field 'tidInBlock'x .x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads(n t563h | r e a d ss)t,e ptSiidzIen(BnlcocclkS(htmherme.acdoImdmx..bxu)f,f Sgirzoeusp[(NgCrCoLu_PpR)O,T O _| S ^~~~~~~~~~~~~~~~~I MPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h]:/562N:C60C:L _note: Sfield 'group' will be initialized after field 'stepSize'T EPS/s i562z | e o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 324t:i90d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(thr e324a | d I d x . x )P,r igmriotuipv(egsr, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx906. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run<1>, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hCCL:_562P:R15O:T Owarning: _initializer order does not match the declaration order [-Wreorder-ctor]# #proto>().r u562n | ( & n c ctliSdh(mteimd.)w,o rnkt)h;r e\a d s| ( ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562):,15 :t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'I nBloc k562( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t idInB l563o | c k ( t hsrteeapdSIidzxe.(xn)c,c lgSrhomuepm(.gcroomump.)b,u f f| S ^~~~~~~~~~~~~~~~~i zes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L60_:P Rnote: Ofield 'group' will be initialized after field 'stepSize'T O_SIM P562L | E ] / N CtCiLd_(StTiEdP)S,/ snitzheroefa(dTs)()n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s )| , group(group tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a68d:I56d:x .note: xin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here) , g r68o | u p ( g rPoruipm)i,t i v| e ^~~~~~~~~~~s , 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 13 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ :21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 73%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h= :r514e:c9v:P twarning: rvariable 'offset' set but not used [-Wunused-but-set-variable]( 0)+ l514l | 1 2 8 O fifnste to;f f s| e ^~~t = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 62%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppi:d1): ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t10h: rIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.ha:d168s: (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hn:t153h:r14e:a dwarning: sunused variable 'data1' [-Wunused-variable]) , tidInBlo c153k | ( t h r euaidnItd3x2._xt) ,d agtrao1u,p (fglraogu1p,) ,d a t| a ^~~~~~~~~~~~~~~~~2 , fl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:g5622:;60 : | note: ^~~~~field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h: 153562: | 21 : warning: unused variable 'flag1' [-Wunused-variable]t id(t i153d | ) , n tuhirneta3d2s_(tn tdharteaa1d,s )f,l atgi1d,I ndBaltoac2k,( tfhlraega2d;I d | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ x.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 595 | r u562n | T r e e UtpiDdo(wtniI>n(Balrogcsk)(;t h r| e ^a dIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :g202r:o53u:p (note: gin instantiation of member function 'RunWorkElement, 0, 2>::run' requested herer oup), 202 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) Ru n563W | o r k E lsetmeepnStis([)N.CrCuLn_(PwReO)T;O _ S| I ^M PLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp_:S5T:E1P:S /note: sin instantiation of member function 'RunWork, 0, 2>::run' requested herei zeof( T5) | )I M{P L _| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O L L| _ group(groupF UNC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:d324u:c90e:, note: Tin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereR EE, S324I | M P L E , PPrreiMmuiltSiuvme,s k,< n/c*cDliFruencct#=#*f/u0n,c ,P rtoytpoe,, 0F>u npcr#i#mdse v r| e ^d op595,: 5N:C Cnote: Lin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here_ ALG O595_ | # # a l grou,n TNrCeCeLU_pPDRoOwTnO<_T#,# pRreodtOop>,( )P.rroutno(S&inmcpclleSw>o(rakr)g;s )\; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | 562 | t i dR(utniWdo)r,k Enltehmreenatdr(e)a.drIudnx(.wxe)),; g r| o ^u p(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp):,5 : 1| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 0, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :560 | :I Mnote: Pfield 'group' will be initialized after field 'stepSize'L _COLL _562F | U N C ( AtlildR(etdiudc)e,, nTtRhErEe,a dSsI(MnPtLhEr,e aPdrse)M,u ltSiudmI,n Bulionctk8(_tth)r e a| d^I dx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:)391,: 95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p (group) ,391 | | ^~~~~~~~~~~R unWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56360 | : note: field 'group' will be initialized after field 'stepSize' stepS i562z | e ( n c ctliSdh(mteimd.)c,o mnmt.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrhreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::4562::115:: note: warning: in instantiation of member function 'RunWork, 0, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 4 | 562I | M P L _ CtOiLdL(_tFiUdN)C,( AnltlhRreedaudcse(,n tThRrEeEa,d sS)I,M PtLiEd,I nPBrleoMcukl(Stuhmr,e aidnItd8x_.tx)) , | g^r oup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:)95,: note: | expanded from macro 'IMPL_COLL_FUNC' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | R563u | n W o r ksO,_ SNICMCPLL_EA]L/GNOC_C#L#_aSlTgEoP,S /NsCiCzLe_oPfR(OTT)O)_ #{# p r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t o >| ( group(group) .run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:c275l:S90h:m enote: min instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here. work) ;275 | \ | ^ Primit/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:v562e:s15<:T ,note: field 'nthreads' will be initialized after field 'tidInBlock'R edOp, 562F | a n A s ytmimde(ttriidc)<,N CnCtLh_rMeAaXd_sD(EnVt_hArReIaTdYs,) ,1 >t,i d/I*nDBilroecckt(=t*h/r0e,a dPIrdoxt.ox,) ,0 >g rporuipm(sg r o| u ^p ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~595 :5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here: 60: note: field 'group' will be initialized after field 'stepSize'595 | 562r | u n T r eteiUdp(Dtoiwdn)<,T ,n tRherdeOapd,s (PnrtohtroeSaidmsp)l,e B>l(oacrkg(st)h;r e a| d ^I dx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202g:r53o:u pnote: (in instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereg roup )202, | | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hBlock:(562t:h15r: warning: initializer order does not match the declaration order [-Wreorder-ctor] eadIdx.x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~n threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), Shmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,warning: group(initializer order does not match the declaration order [-Wreorder-ctor]g roup), | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60t:i dnote: (field 'group' will be initialized after field 'stepSize't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p (grou p563) | , | ^~~~~~~~~~~s tepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~) , g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60(:g rnote: ofield 'group' will be initialized after field 'stepSize'u p), 562| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id(ti d563) | , n t hsrteeapdSsi(znet(hnrcecaldSsh)m,e mt.icdoImnmB.lboucfkf(Stihzreesa[dNICdCxL._xP)R,O TgOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~T EPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mple<1, 1>>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnBloc:k562(:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d Idx.x), group(group), 562 | | ^~~~~~~~~~~~~~~~~ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t60i:d )note: ,field 'group' will be initialized after field 'stepSize' nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) group (563g | r o u p )s,t e p| S ^~~~~~~~~~~i ze(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:(562w:e15):; warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp :5626 | : 1 : note: tin instantiation of member function 'RunWork, 0, 2>::run' requested herei d(t i6d | )I,M PnLt_hCrOeLaLd_sF(UnNtCh(rAelaldRse)d,u ctei,d ITnRBElEo,c kS(ItMhPrLeEa,d IPdrxe.Mxu)l,S ugmr,o uipn(tg3r2o_utp)) , | ^| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: 563note: | expanded from macro 'IMPL_COLL_FUNC' ste p391S | i z eR(unncWcolrSkh_,S TNECPCSL/_sAiLzGeOo_f#(#Ta)l)g o{, N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| P group(groupR OTO_##pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:o324>:(90):. rnote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren (&ncc l324S | h m e m . w oPrrki)m;i t\i v e| s ^< T, RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:,562 :F15a:n Anote: sfield 'nthreads' will be initialized after field 'tidInBlock'y mmetr i562c | < 1 , NtCiCdL(_tMiAdX)_,D EnVt_hArReIaTdYs>(,n t/h*rDeiardesc)t,= *t/i0d,I nPBrlootcok,( t0h>r epardiImdsx . x| ) ^, group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r595o:u5p:) ,note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here | ^~~~~~~~~~~~~~~~~ 595 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60r:u nnote: Tfield 'group' will be initialized after field 'stepSize'r eeUp D562o | w n < T ,t iRde(dtOipd,) ,P rnotthorSeiamdpsl(end>s()a,r gtsi)d;I n B| l ^o ck(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202I:d53x:. xnote: )in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here, gro u202p | ( g r o u p ) , R u| n ^~~~~~~~~~~W orkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::53562:: 15note: :in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here warning: initializer order does not match the declaration order [-Wreorder-ctor] 202 | 562 | R u ntWiodr(ktEilde)m,e nnttc(k)(.trhurne(awdeI)d;x . x| ) ^, group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppo:u7p:)1,: note: | in instantiation of member function 'RunWork, 0, 2>::run' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 7 | I563M | P L _ C OsLtLe_pFSUiNzCe((AnlclcRleSdhumceem,. cToRmEmE.,b uSfIfMSPiLzEe,s [PNrCeCMLu_lPSRuOmT,O _uSiInMtP3L2E_]t/)N C C| L^_ STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:/391s:i95z:e onote: fexpanded from macro 'IMPL_COLL_FUNC'( T)) { 391| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn Work, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here c, t275y | p e , F u nPcr#i#mdietvirveedsoO,p ,N CFCaLn_AAsLyGmOm_e#t#railcg>(,) ./r*uDni(r&encctc=l*S/h0m,e mP.rwootrok,) ;0 >\ p r| i ^m s | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 595note: :field 'nthreads' will be initialized after field 'tidInBlock'5 : note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 562 | 595 | t i d (rtuindT)r,e enUtphDroewande>a(daIrdgxs.)x;) , | g ^r oup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:)53,: note: | in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h202: | 562 : 60 : note: field 'group' will be initialized after field 'stepSize' RunW o562r | k E l e mteindt(t(i)d.IrnuBnl(owcek)(;t h r| e ^a dIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp,: 10g:r1o:u pnote: (in instantiation of member function 'RunWork, 0, 2>::run' requested hereg roup )10, | I M| P ^~~~~~~~~~~L _COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(grou p562) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t id), n563t | h r e a dsst(enptShirzeea(dnsc)c,l SthimdeImn.Bcloomcmk.(btuhfrfeSaidzIedsx[.NxC)C,L _gPrRoOuTpO(_gSrIoMuPpL)E,] / N| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C L _| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T EPS/s i563z | e o f ( Ts)t)e p{S i z| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( n c| c group(groupl Shmem.comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:f275f:S90i:z enote: sin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here[ NCCL_ P275R | O T O _ S I MPPrLiEm]i/tNiCvCeLs_, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 1>, / *275D | i r e c t = *P/r0i,m iPtriovteos,< T0,> RperdiOmps, F| a ^n Asym/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:e595t:r5i:c , ProtoSimple<1, 1>>' requested hereC CL_MAX_DEV_ARITY, 1>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ of(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitivesh,r e/a*dDsi(rnetchtr=e*a/d0s,) ,P rtoitdoI,n B0l>o cpkr(itmhsr e a| d ^I dx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,595 :g5r:o unote: pin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here( gro u595p | ) , | r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u n T| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e eUpDo w563n | < T , RsetdeOppS,i zPer(ontcocSliSmhpmleem<.1c,o m1m>.>b(uafrfgSsi)z;e s [| N ^C CL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T202O:_53S:I Mnote: Pin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereL E]/ N202C | C L _ S T E P S /RsuinzWeoorfk(ETl)e)m e{n t <| F ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n , | T group(group, RedOp, Algo, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:t275o:>90(:) .note: rin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n(we); | ^ 275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp :P9r:i1m:i tnote: iin instantiation of member function 'RunWork, 0, 2>::run' requested herev es< T9, | IRMePdLO_pC,O LFLa_nFAUsNyCm(mAeltlrRiecdM,u l/S*uDmi,r eucitn=t*6/40_,t )P r o| t^o , 0> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:r391i:m95s: note: | expanded from macro 'IMPL_COLL_FUNC' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h391: | 595 : 5R:u nnote: Win instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereo rk <595n | c c l F urnucn#T#rfeuenUcp,D otwynp1,, N1C>C>L(_aArLgGsO)_;# # a| l ^g o, NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T202O:_53#:# pnote: rin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereo to>() .202r | u n ( & n c c l SRhumneWmo.rwkoErlke)m;e n\t < F| n ^, T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562O:p15,: Anote: lfield 'nthreads' will be initialized after field 'tidInBlock'g o, Pr o562t | o > ( ) .triudn((twied));, n| t ^h reads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppt:h6r:e1a:d snote: )in instantiation of member function 'RunWork, 0, 2>::run' requested here, tid I6n | BIlMoPcLk_(CtOhLrLe_aFdUINdCx(.Axl)l,R egdruocuep,( gTrRoEuEp,) ,S I M| P ^~~~~~~~~~~~~~~~~L E, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562M:u60l:S unote: mfield 'group' will be initialized after field 'stepSize', int3 2562_ | t ) | t^i d(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:)391,: 95n:t note: expanded from macro 'IMPL_COLL_FUNC' hrea d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~ NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _MAX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:)562;: 15 :| ^warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1 :562 | note: in instantiation of member function 'RunWork, 0, 2>::run' requested here ti d12( | tIiMdP)L,_ CnOtLhLr_eFaUdNsC((nAtlhlrReeaddusc)e,, tTiRdEIEn,B lSoIcMkP(LtEh,r ePardeIMduxl.Sxu)m,, gdroouubpl(e) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives | , / * Dtiirde(ctti=d*)/,0 ,n tPhrroetaod,s (0n>t hprreiamdss ) ,| ^t idInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:o595c:k5(:t hnote: rin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heree adI d595x | . x ) , rgurnoTurpe(egUrpoDuopw)n,< T ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R e d| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p , Pr o563t | o S i m psltee(>n(cacrlgSsh)m;e m .| c ^o mm.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:i202z:e53s:[ Nnote: Cin instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereC L_PR O202T | O _ S I M P L E ]R/uNnCWCoLr_kSETlEePmSe/nsti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:)275.:r90u:n (note: win instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ); | ^275 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp :P9r:i1m:i tnote: iin instantiation of member function 'RunWork, 0, 2>::run' requested herev ese,M u/l*SDuimr,e cuti=n*t/604,_ tP)r o t| o^, 0> /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:r391i:m95s: note: | expanded from macro 'IMPL_COLL_FUNC' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :391595 | : 5 :R unote: nin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested hereW ork <595n | c c l F urnucn#T#rfeuenUcp,D otwynp1,, N1C>C>L(_aArLgGsO)_;# # a| l ^g o, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here# #pro t202o | > ( ) . r u n ( &RnucncWloSrhkmEelme.mweonrtk<)F;n ,\ T ,| ^R edOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :A562l:g15o:, note: Pfield 'nthreads' will be initialized after field 'tidInBlock'r oto>( )562. | r u n ( wtei)d;( t i| d ^) , nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppe:a11d:s1(:n tnote: hin instantiation of member function 'RunWork, 0, 2>::run' requested herer eads )11, | ItMiPdLI_nCBOlLoLc_kF(UtNhCr(eAaldlIRdexd.uxc)e,, gTrRoEuEp,( gSrIoMuPpL)E,, P| r ^~~~~~~~~~~~~~~~~e MulS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:m562,: 60f:l onote: afield 'group' will be initialized after field 'stepSize't ) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391t:i95d:( tnote: iexpanded from macro 'IMPL_COLL_FUNC'd ), nth r391e | a d sR(unntWhorreka,, N| C ^~~~~~~~~~~C L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ AX_DEV_ARITY>, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidInBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d InBlo c563k | ( t h r esatdeIpdSxi.zxe)(,n cgcrloSuhpm(egmr.ocuopm)m,. b u| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f S i| z tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e s[NCC L563_ | P R O T Os_tSeIpMSPiLzEe](/nNcCcClLS_hSmTeEmP.Sc/osmimz.ebouff(fTS)i)z e{s [ N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| P group(groupR OTO_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/:N275C:C90L:_ Snote: Tin instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereE PS/size o275f | ( T ) ) { P r| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m i t| i group(groupv es, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem metr i324c | < N C C L _ MPArXi_mDiEtVi_vAeRsId,O p/,* DFiarneAcsty=m*m/e0t,r iPcrL _pMrAiXm_sD E V| _ ^A RITY>,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :/595*:D5i:r enote: cin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested heret =*/0 ,595 | P r o t or,u n0T>r eperUipmDso w n| < ^T , Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:p595,: 5P:r onote: tin instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested hereo Sim p595l | e < 1 , r1u>n>T(raeregUsp)D;o w n| < ^T , RedOp, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:r202o:t53o:S inote: min instantiation of member function 'RunWorkElement, 0, 2>::run' requested herep le<1, 2021 | > > ( a r g s ) ;R u n| W ^o rkElem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:n202t:<53F:n ,note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereT , R e202d | O p , A l g o ,R uPnrWootrok>E(l)e.mreunnt(, 0, 2>::run' requested here> ().ru n9( | wIeM)P;L _ C| O ^L L_FU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppN:C10(:A1l:l Rnote: ein instantiation of member function 'RunWork, 0, 2>::run' requested hered uc e10, | ITMRPELE_,C OSLILM_PFLUEN,C (PArlelMRueldSuucme,, uTiRnEtE6,4 _StI)M P L| E^, PreM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:l391S:u95m:, note: hexpanded from macro 'IMPL_COLL_FUNC'a lf) | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u391n:W95o:r knote: u,n cN#C#CdLe_vArLeGdOo_p#<#taylpgeo>,, NNCCCCLL__PARLOGTOO__####aplrgoot,o >N(C)C.Lr_uPnR(O&TnOc_c#l#Sphrmoetmo.>w(o)r.kr)u;n (\& n c| c ^l Shmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:o562r:k15):; note: \field 'nthreads' will be initialized after field 'tidInBlock' | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~u p(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.worke); \ e U| p ^D own >562( | a r g s )t;i d (| t ^i d), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202):,53 :t inote: din instantiation of member function 'RunWorkElement, 0, 2>::run' requested hereI nBlock( t202h | r e a d I d x . xR)u,n WgorrokuEpl(egmreonutp<)F,n , | T ^~~~~~~~~~~~~~~~~, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562O:p60,: Anote: lfield 'group' will be initialized after field 'stepSize'g o, P r562o | t o > ( )t.irdu(nt(iwde)),; n t| h ^r eads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cppe:a12d:s1):, note: tin instantiation of member function 'RunWork, 0, 2>::run' requested herei dInB l12o | cIkM(PtLh_rCeOaLdLI_dFxU.NxC)(,A lglrRoeudpu(cger,o uTpR)E,E , | S ^~~~~~~~~~~I MPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:note: 562expanded from macro 'IMPL_COLL_FUNC': 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunW o562r | k < n c ctliFdu(ntci#d#)f,u nnct,h rteyapdes,( nFtuhnrce#a#ddse)v,r etdiodpIk,( tNhCrCeLa_dAILdGxO._x#)#,a lggroo,u pN(CgCrLo_uPpR)O,T O _| # ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~# p r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t o>( 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx908. 27 warnings generated when compiling for gfx941. 27 warnings generated when compiling for gfx940. 27 warnings generated when compiling for gfx90a. 27 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:275:90: note: in instantiation of member function 'Primitives, FanAsymmetric<2, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 275 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:324:90: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 2>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 324 | Primitives, /*Direct=*/0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:595:5: note: in instantiation of function template specialization '(anonymous namespace)::runTreeUpDown, ProtoSimple<1, 1>>' requested here 595 | runTreeUpDown>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 0, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 0, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, TREE, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 27 warnings generated when compiling for host. 27 warnings generated when compiling for gfx1030. 27 warnings generated when compiling for gfx803. 27 warnings generated when compiling for gfx1102. 27 warnings generated when compiling for gfx1100. 27 warnings generated when compiling for gfx900. 27 warnings generated when compiling for gfx1101. 27 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hwarning: :unused variable 'flag2' [-Wunused-variable]386 :9 :153 | warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] u i386n | t 3 2 _ ti ndta twai1r,e Offlfasge1t, =d aWtiar2e,W ofrldaPge2r;S l i| c ^~~~~e *warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 74%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Symmetric<1>, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlocIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ k(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp 68: | 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :P10r: iIn file included from m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hi:t167i: v/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:s562<:T15,: Rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]d Op, FanSy m562m | e t r i ct(,t i0d,) ,P rnotthor,e a0d>s (pnrtihmrse a d| s ^) , tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:n588B:l5o:c knote: (in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heret hre a588d | I d x .rxu)n,R ignrgo| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rgs); 563 | | ^ stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e202(:n53c:c lnote: Sin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereh mem. c202o | m m . b u f f S iRzuensW[oNrCkCELl_ePmReOnTtO<_FSnI,M PTL,E ]R/eNdCOCpL,_ SATlEgPoS,/ sPirzoetoof>((T)).)r u{n ( w| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::168:: 56note: :in instantiation of member function 'RunWork, 1, 2>::run' requested here note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 4 | I68M | P L _ C OPLrLi_mFiUtNiCv(eAslo,d ,0 ,i nPtr8o_tto), 0| >^ prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs: 391 :| 95 ^: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5 :391 | note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here Run W588o | r k < n crculnFRuinncg#<#Tf,u nRce,d Otpy,p eP,r oFtuon>c(#a#rdgesv)r;e d o| p ^< type>,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :N202C:C53L:_ Anote: Lin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereG O_## a202l | g o , N C C L _RPuRnOWToOr_k#E#lpermoetnot><(F)n.,r uTn,( &RnecdcOlpS,h mAelmg.ow,o rPkr)o;t o\> ( )| . ^r un(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):;562 : 15| : ^ note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4 :5621 | : note: in instantiation of member function 'RunWork, 1, 2>::run' requested here tid (4t | iIdM)P,L _nCtOhLrLe_aFdUsN(Cn(tAhlrleRaeddsu)c,e ,t iRdIINnGB,l oScIkM(PtLhEr,e aPdrIoddx,. xi)n,t 8g_rto)u p (| g^r oup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^~~~~~~~~~~~~~~~~: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :391 | note: field 'group' will be initialized after field 'stepSize' RunWo r562k | < n c c ltFiudn(ct#i#df)u,n cn,t htryepaed,s (Fnutnhcr#e#addesv)r,e dtoipdc,k (NtChCrLe_aAdLIGdOx_.#x#)a,l ggor,o uNpC(CgLr_oPuRpO)T,O _ #| # ^~~~~~~~~~~p roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock': 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 60| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize'| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t isdt(etpiSdi)z,e (nntchcrleSahdmse(mn.tchormema.dbsu)f,f StiizdeIsn[BNlCoCcLk_(PtRhOrTeOa_dSIIdMxP.LxE)],/ NgCrCoLu_pS(TgErPoSu/ps)i,z e o| f ^~~~~~~~~~~( T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ric<1>, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up(group), 562| | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :R uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]W orkEleme n562t | < F n , tTi,d (RteiddO)p,, nAtlhgroe,a dPsr(onttoh>r(e)a.drsu)n,( wtei)d;I n B| l ^o ck(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cppe:a9d:I1d:x .note: xin instantiation of member function 'RunWork, 1, 2>::run' requested here) , g r9o | uIpM(PgLr_oCuOpL)L,_ F U| N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C ( A| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Reduc e563, | R I N Gs,t eSpISMiPzLeE(,n cPcrloSdh,m eumi.ncto6m4m_.tb)u f f| S^i zes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C391C:L95_:P Rnote: Oexpanded from macro 'IMPL_COLL_FUNC'T O_SIM P391L | E ] /RNuCnCWLo_rSkT, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herep e>, N68C | C L _ A LPGrOi_m#i#tailvgeos,< TN,C CRLe_dPORpO,T OF_a#n#Spyrmomteot>r(i)c.n,( &0n,c cPlrSohtmoe,m .0w>o rpkr)i;m s\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::588562::515:: note: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 588 | 562 | r u ntRiidn(gth(raeragdss));, t| i ^d InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#15p:r owarning: tinitializer order does not match the declaration order [-Wreorder-ctor]o >().run(& n562c | c l S h mteimd.(wtoirdk)),; n\t h r| e ^a ds(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: )field 'nthreads' will be initialized after field 'tidInBlock', tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~b uffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:s60[:N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:x68.:x56):, note: gin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herer oup(g r68o | u p ) , P r| i ^~~~~~~~~~~m itives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562v:e15s:< Twarning: ,initializer order does not match the declaration order [-Wreorder-ctor] RedOp, F a562n | S y m m ettirdi(ct),, 0n,t hPrreoatdos,( n0t>h rperaidmss) , | t ^i dInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:k588(:t5h:r enote: ain instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hered Idx .588x | ) , g rrouunpR(ignrgo( a563r | g s ) ; s t| e ^p Size(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:S202h:m53e:m .note: cin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereo mm.b u202f | f S i z e s [ N CRCuLn_WPoRrOkTEOl_eSmIeMnPtL ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(groupw e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp :note: 6in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here: 1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here68 | 6P | rIiMmPiLt_iCvOeLsL<_TF,U NRCe(dAOlpl,R eFdauncSey,m mReItNrGi,c M,P L0E,, PMriont,o ,i n0t>3 2p_rti)m s | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h391::58895::5 :note: expanded from macro 'IMPL_COLL_FUNC'note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | 391 | RruunnWRoirnkg<t(yapreg,s )F;u n c| # ^# devredo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:<202t:y53p:e >note: ,in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here NCCL _202A | L G O _ # # a l gRou,n WNoCrCkLE_lPeRmOeTnOt_<#F#np,r oTt,o >R(e)d.Orpu,n (A&lngcoc,l SPhrmoetmo.>w(o)r.kr)u;n (\w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp :note: 6field 'nthreads' will be initialized after field 'tidInBlock': 1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 562 | 6 | tIiMdP(Lt_iCdO)L,L _nFtUhNrCe(aAdlsl(Rnetdhurceea,d sR)I,N Gt,i dSIInMBPlLoEc,k (Mtihnr,e aidnItd3x2._xt)), g| r^o up(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:)95,: note: | expanded from macro 'IMPL_COLL_FUNC' ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562391: | 60 : Rnote: ufield 'group' will be initialized after field 'stepSize'n Work< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~t o>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d Idx.x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads( n563t | h r e a dsst)e,p StiizdeI(nnBclcolcSkh(mtehmr.ecaodmImd.xb.uxf)f,S igzreosu[pN(CgCrLo_uPpR)O,T O _| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I M P| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)E ]/NCC L563_ | S T E P Ss/tseipzSeiozfe((Tn)c)c l{S h m| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m . c| o group(groupm m.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:z68e:s56[:N Cnote: Cin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereL _PRO T68O | _ S I M PPLrEi]m/iNtCiCvLe_sS , 0, Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:,68 :056>: pnote: rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herei ms | 68 ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:r588i:m5i:t inote: vin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree s ,P r0o,t oP>r(oatrog,s )0;> p| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, in/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), titdI3n2B_lto)c k (| t^h readIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95(:g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p), | ^~~~~~~~~~~~~~~~~ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k i,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~) .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391562::9515:: note: warning: expanded from macro 'IMPL_COLL_FUNC'initializer order does not match the declaration order [-Wreorder-ctor] 391 | R562u | n W o r ktr,e aNdCICdLx_.AxL)G,O _g#r#oaulpg(og,r oNuCpC)L,_ P R| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T O _| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# proto >563( | ) . r u ns(t&enpcScilzSeh(mnecmc.lwSohrmke)m;. c\o m m| . ^b uffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h[:N562C:C15L:_ Pnote: Rfield 'nthreads' will be initialized after field 'tidInBlock'O TO_SIM P562L | E ] / N CtCiLd_(StTiEdP)S,/ snitzheroefa(dTs)()n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s )| , group(group tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:t68h:r56e:a dnote: Iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hered x.x), 68g | r o u p (Pgrriomuipt)i,v e s| < ^~~~~~~~~~~~~~~~~T , Re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:,60 :F anote: nfield 'group' will be initialized after field 'stepSize'S ymmet r562i | c < 1 > ,t i0d,( tPirdo)t,o ,n t0h>r epardism(sn t h| r ^e ads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i588d:I5n:B lnote: oin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herec k(th r588e | a d I d xr.uxn)R,i nggr(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), ntnhreadcs#(#nftuhnrce,a dtsy)p,e ,t iFduInncB#l#odcekv(rtehdroepax,) ,N CgCrLo_uApL(GgOr_o#u#pa)l,g o ,| ^~~~~~~~~~~N CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | ^ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t60e:p Snote: ifield 'group' will be initialized after field 'stepSize'z e(nccl S562h | m e m . ctoimdm(.tbiudf)f,S inztehsr[eNaCdCsL(_nPtRhOrTeOa_dSsI)M,P LtEi]d/INnCBClLo_cSkT(EtPhSr/esaidzIedoxf.(xT)),) g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( g r| o group(groupu p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 66%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, 562P | r o t o ,t i0d>( tpirdi)m,s n t| h ^r eads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h588r:e5a:d snote: )in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here, tid I588n | B l o c kr(utnhRrienagdo(uapr)g,s ) ;| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53563: | note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here step S202i | z e ( n c c lS RhumneWmo.rckoEmlme.mbeunftfC(C)L._rSuTnE(PwSe/)s;i z e| o ^f (T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~6 : 1| : group(group note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:C68O:L56L:_ Fnote: Uin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereN C(All R68e | d u c e ,P rRiImNiGt,i vSeIsM, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h0:,391 :P95r:o tnote: oexpanded from macro 'IMPL_COLL_FUNC', 0> pr i391m | s R| u ^n Work, ProtoSimple<2, 2>>' requested here# fun c588, | t y p er,u nFRuinncg#<#Td,e vRreeddOopp,< tPyrpoet>o,> (NaCrCgLs_)A;L G O| _ ^# #algo, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L202_:P53R:O Tnote: Oin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here_ ##pro t202o | > ( ) . r u n ( &RnucncWloSrhkmEelme.mweonrtk<)F;n ,\ T ,| ^R edOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :A562l:g15o:, note: Pfield 'nthreads' will be initialized after field 'tidInBlock'r oto>( )562. | r u n ( wtei)d;( t i| d ^) , nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cppd:s8(:n1t:h rnote: ein instantiation of member function 'RunWork, 1, 2>::run' requested herea ds), 8t | iIdMIPnLB_lCoOcLkL(_tFhUrNeCa(dAIldlxR.exd)u,c eg,r oRuIpN(Gg,r oSuIpM)P,L E ,| ^~~~~~~~~~~~~~~~~M ax, i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t5626:460_:t )note: field 'group' will be initialized after field 'stepSize' | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'( tid), n391t | h r eRaudnsW(onrtkhu,p )N,C C L| _ ^~~~~~~~~~~A LGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :r562u:n15R:i nwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]< T, RedO p562, | P r o ttoi>d((atrigds)),; n t| h ^r eads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:s53):, note: tin instantiation of member function 'RunWorkElement, 1, 2>::run' requested herei dInBl o202c | k ( t h r e a d IRduxn.Wxo)r,k Eglreomuep(ngtrp(S)i.zreu(nn(cwcel)S;h m e| m ^. comm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cppf:S13i:z1e:s [note: Nin instantiation of member function 'RunWork, 1, 2>::run' requested hereC CL_P R13O | TIOM_PSLI_MCPOLLEL]_/FNUCNCCL(_ASlTlERPeSd/usciez,e oRfI(NTG),) S{I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E , | M group(groupa x, rccl_b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:l68o:a56t:1 6note: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here | ^ 68 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :P391r:i95m:i tnote: iexpanded from macro 'IMPL_COLL_FUNC'v es#,f u0n,c ,P rtoytpoe,, 0F>u npcr#i#mdse v r| e ^d op5,: Nnote: Cin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereC L_ A588L | G O _ # #raulngRoi,n gNo(>a(r)g.sr)u;n ( &| n ^c clShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:.202w:o53r:k )note: ;in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here \ | 202 ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :R562u:n15W:o rnote: kfield 'nthreads' will be initialized after field 'tidInBlock'E lement <562F | n , T ,t iRde(dtOipd,) ,A lngtoh,r ePardost(on>t(h)r.eraudns()w,e )t;i d I| n ^B lock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp(:t9h:r1e:a dnote: Iin instantiation of member function 'RunWork, 1, 2>::run' requested hered x. x9) | ,I MgPrLo_uCpO(LgLr_oFuUpN)C,( A l| l ^~~~~~~~~~~~~~~~~R educ/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:,562 :R60I:N Gnote: ,field 'group' will be initialized after field 'stepSize' SIMPL E562, | M a x ,t iudi(ntti6d4)_,t )n t h| r^e ads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's ), tidI n391B | l o cRku(ntWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:(68t:h56r:e anote: din instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereI dx.x), gro u68p | ( g r o uPpr)i,m i t| i ^~~~~~~~~~~v es, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 67%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N562C | C L _ A LtGiOd_(#t#iadl)g,o ,n tNhCrCeLa_dPsR(OnTtOh_r#e#apdrso)t,o >t(i)d.IrnuBnl(o&cnkc(ctlhSrhemaedmI.dwxo.rxk)),; g\r o u| p ^( group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: note: | field 'nthreads' will be initialized after field 'tidInBlock' tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562563 | | tsitde(ptSiidz)e,( nnctchlrSehamdesm(.nctohmrme.abdusf)f,S itziedsI[nNBClCoLc_kP(RtOhTrOe_aSdIIMdPxL.Ex])/,N CgCrLo_uSpT(EgPrSo/uspi)z,e o f| ( ^~~~~~~~~~~~~~~~~T )) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~60 : | note: group(groupfield 'group' will be initialized after field 'stepSize' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 68 : 56 :t inote: din instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here( tid) ,68 | n t h r ePardism(inttihvreesa.,x )0,, gPrrooutpo(,g r0o>u pp)r,i m s| ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:68:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 68 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:588:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 588 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1:7 warnings generated when compiling for gfx1030. note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:109:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 109 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Broadcast, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx1030. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:35:56: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 35 | Primitives, 0, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/alltoall_pivot.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllToAllPivot, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for gfx900. 8 warnings generated when compiling for gfx941. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1102. 8 warnings generated when compiling for gfx1101. 8 warnings generated when compiling for gfx908. 8 warnings generated when compiling for gfx940. 8 warnings generated when compiling for gfx803. 8 warnings generated when compiling for gfx1100. 8 warnings generated when compiling for gfx906. 8 warnings generated when compiling for gfx90a. 8 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:9: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:53:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 53 | prims(tid, nthreads, &ring->prev, &ring->next, inputBuf, outputBuf, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/broadcast.h:95:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 95 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Broadcast, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 8 warnings generated when compiling for host. 8 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ , flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 78%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp warning: :unused variable 'flag2' [-Wunused-variable]1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h: 10153: | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 168 : u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hi:n153t:3142:_ twarning: unused variable 'data1' [-Wunused-variable]d ata1, flag1 ,153 | d a t a 2u,i nftl3a2g_2t; d a| t ^~~~~a 1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 79%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElementp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | 7 warnings generated when compiling for gfx941. stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 80%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] In file included from 514/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp | : 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:n10t: In file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hf:f168s: e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ht: 153=: 14t:i dwarning: ;unused variable 'data1' [-Wunused-variable] | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, datIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp2:,1 : fIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ha:g102: ;In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :| 168 ^~~~~: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h::14153:: 21warning: :unused variable 'data1' [-Wunused-variable] warning: unused variable 'flag1' [-Wunused-variable] 153 | 153 | u i n t 3u2i_ntt 3d2a_tta 1d,a tfal1a,g 1f,l adga1t,a 2d,a tfal2a,g 2f;l a g| 2 ^~~~~; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h ^~~~~: 153:28/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:: 153warning: :unused variable 'data2' [-Wunused-variable]21 : warning: unused variable 'flag1' [-Wunused-variable]153 | 153 | u i n t 3u2i_ntt 3d2a_tta 1d,a tfal1a,g 1f,l adga1t,a 2d,a tfal2a,g 2f;l a g| 2 ^~~~~; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h ^~~~~: 153:35/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:: 153warning: :unused variable 'flag2' [-Wunused-variable]28 : warning: unused variable 'data2' [-Wunused-variable]153 | 153u | i n t 3 2u_itn td3a2t_at1 ,d aftlaa1g,1 ,f ldaagt1a,2 ,d aftlaa2g,2 ;f l a| g ^~~~~2 ; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)=/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ =3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkEle/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :p rwarning: iinitializer order does not match the declaration order [-Wreorder-ctor]m s(tid-tidStart R562e | d u c e ,t indT(htrieda)d,s Rnetdhurceea,d sn(unltlhprtera,d s&)d,i rteicdtI-n>Boluotc,k (atrhgrse-a>dsIednxd.bxu)f,f ,g raorugps(-g>rroeucpv)b,u f f| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53s:t enote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS ize( n202c | c l S h m e m . cRoumnmW.obrukfEflSeimzeenst[E(P)S./rsuinz(ewoef)(;T ) )| ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | :I626M:P9L:_ Cnote: Oin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL L_FUN C626( | A l l R e d u c ep,r iCmOsL(LtNiEdT-_tDiIdRSEtCaTr,t SScIaMtPtLeEr,, SnuTmhProesatdDsiSvc,a titnetr8,_ tN)U L L| ,^ direct-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:u391p:,95 :a rnote: gexpanded from macro 'IMPL_COLL_FUNC's ->sendbu f391f | , aRrugnsW-o>rrke, 2, 2>::run' requested heretc i#d#(d te202vi | rde )d ,o p n< tt hy rp eeRa>ud,ns W(NonCrtCkhLEr_leAeaLmdGesOn)_t,#< #Ftanil,dg IoTn,,B lNRoCecCdkLO(_ptP,hR rOAeTlaOgd_oI#,d# xpP.rrxoo)tt,oo >>g((r))o..urrpuu(nng((r&woneuc)pc;)l ,S h| m ^| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~m . w| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppk:)5 ;:563 1 | \: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^ s tep S5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi | :zI562e:M(15Pn:Lc _cnote: Clfield 'nthreads' will be initialized after field 'tidInBlock'OS LhLm_e Fm562U. | Nc Co (mA ml .ltbRiuedfd(futSciiedz,)e ,sC [OnNLtCLhCNrLEe_TaP_dRDsOI(TRnOEt_ChSTrI,eM aPSdLIsEM)]P,/L NEtC,iC dLSI_unSmBTlPEooPcsSkt/(Dstiihvzr,ee oaufdi(InTdt)x8)._ xt{)) , | g| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r^ o u| p group(group( g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h95)::,677 :note: 11expanded from macro 'IMPL_COLL_FUNC'| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :391562 | : 60677 : | R unote: nfield 'group' will be initialized after field 'stepSize' W o r k 562< | n c cp lr iFtmuisnd(c(t#ti#idfd-u)tn,ic d,nS ttthayrrpeteaB,dc saF(suntnt,ch #rn#eTdahedrvser)ae,dd sBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ op, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h nthr:e562a:d15s:( nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:c562k:(60t:h rnote: efield 'group' will be initialized after field 'stepSize'a dIdx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s(n t563h | r e a d ss)t,e ptSiidzIen(BnlcocclkS(htmherme.acdoImdmx..bxu)f,f Sgirzoeusp[(NgCrCoLu_pP)R,O T O| _ ^~~~~~~~~~~S IMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:)15;: \warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup), | 563 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562e:p60S:i znote: efield 'group' will be initialized after field 'stepSize'( ncclS h562m | e m . c otmimd.(btu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hif:df562)S:,i15 z:ne tswarning: h[initializer order does not match the declaration order [-Wreorder-ctor]rN eCaCdLs_(PnRtOh Tr562Oe | _a Sd Is M) P,Lt Eit]di/(dNtICinCdBL)l_,oS cTnkEt(PhtSrh/ersaeidazsde(Iondftx(h.Trx)e))a, d {sg )r ,o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tp i( dg| Ir group(groupno Bulpo)c,k/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h (: t687| h: ^~~~~~~~~~~r11 e:a dnote: Iin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered x.x), g687r | o u p ( g r o u p ) ,p r i| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ( t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d -tidS t563a | r t B c asstte,p SniTzher(enacdcslBSchamsetm,. c&odmimr.ebcutf-f>Soiuzte,s [nNuClClLp_tPrR,O TaOr_gSsI-M>PsLeEn]d/bNuCfCfL,_ SaTrEgPsS-/>sriezcevobfu(fTf),) {| ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :655:11: 202note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | R u n W o r k E l e mpernitme(a)d.srRuend(uwcee),; n u| l ^l ptr, &di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppr:e4c:t1-:> onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heret , arg s4- | >IsMePnLd_bCuOfLfL,_ FaUrNgCs(-A>lrleRcevdbuucfef,, C O| L ^L NET_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:R202E:C53T:, note: Sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereI MPL E202, | S u m P o s t DRiuvn,W oirnktE8l_etm)e n t| <^F n, T,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391e:d95O:p ,note: expanded from macro 'IMPL_COLL_FUNC'A lgo, P r391o | t o >R(u)n.Wrournk(, 2, 2>::run' requested hereu nc## d5e | vIrMePdLo_pCU,N CN(CAClLl_RAeLdGuOc_e#,# aClOgLoL,N ENTC_CDLI_RPERCOTT,O _S#I#MpPrLoEt,o >S(u)m.Prousnt(D&invc,c luSihnmte8m_.tw)o r k| )^; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^391 :95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: note: 391field 'nthreads' will be initialized after field 'tidInBlock' | RunW o562r | k < n c ctliFdu(ntci#d#)f,u nnct,h rteyapdes,( nFtuhnrce#a#ddse)v,r etdiodpIk,( tNhCrCeLa_dAILdGxO._x#)#,a lggroo,u pN(CgCrLo_uPpR)O,T O _| # ^~~~~~~~~~~~~~~~~# prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:>562(:)60.:r unote: nfield 'group' will be initialized after field 'stepSize'( &nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd s), t i562d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hTO_#:#562p:r15o:t owarning: >initializer order does not match the declaration order [-Wreorder-ctor]( ).run(&ncclSh m562e | m . w o rtki)d;( t\i d )| , ^ nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(15n:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e ads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o c k| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread I563d | x . x ) ,s tgerpoSuipz(eg(rnocucpl)S,h m e| m ^~~~~~~~~~~~~~~~~. co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:m562.:b60u:f fnote: Sfield 'group' will be initialized after field 'stepSize'i zes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u641p:(11g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep ), | ^~~~~~~~~~~ 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ zeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tis(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .warning: xinitializer order does not match the declaration order [-Wreorder-ctor]) , group (562g | r o u p )t,i d (| t ^~~~~~~~~~~i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: warning: initializer order does not match the declaration order [-Wreorder-ctor]: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)563 | 563s | t e p S isztee(pnSciczleS(hnmcecml.Schommemm..bcuofmfmS.ibzuefsf[SNiCzCeLs_[PNRCOCTLO__PSRIOMTPOL_ES]I/MNPCLCEL]_/SNTCECPLS_/SsTiEzPeSo/fs(iTz)e)o f{( T )| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ { | group(group| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 687note: :in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | 687 | p r i m s (ptriidm,s (ntTihdr-etaiddsSGtaatrhteBrc,a sdti,r encTth-r>euapd,s BNcUaLsLt,, a&rdgisr-e>cste-n>dobuutf,f ,n ualrlgpst-r>,r eacrvgbsu-f>fs,e n d| b ^u ff, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:r202e:c53v:b unote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref , | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :R53u:n Wnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer kEl e202m | e n t < F n , TR,u nRWeodrOkpE,l eAmlegnot,< FPnr,o tTo,> (R)e.drOupn,( wAel)g;o , | P ^r oto>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppu:n5(:w1e:) ;note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppI:M5P:L1_:C Onote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _FU N5C | (IAMlPlLR_eCdOuLcLe_,F UCNOCL(LANlElTR_eDdIuRcEeC,T ,C OSLIMPLLEN,E TS_uDmIPRoEsCtTD,i vS,I MuPiLnEt,8 _Stu)m P o| s^t D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:i391v:,95 :u inote: nexpanded from macro 'IMPL_COLL_FUNC't 8_t) | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u391n:W95o:r knote: u,n cN#C#CdLe_vArLeGdOo_p#<#taylpgeo>,, NNCCCCLL__PARLOGTOO__####aplrgoot,o >N(C)C.Lr_uPnR(O&TnOc_c#l#Sphrmoetmo.>w(o)r.kr)u;n (\& n c| c ^l Shmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:)15;: \note: field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dnote: (field 'nthreads' will be initialized after field 'tidInBlock't id), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~o up(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562 :515 | :I Mwarning: Pinitializer order does not match the declaration order [-Wreorder-ctor]L _COLL_FUNC (562A | l l R e dtuicde(,t iCdO)L,L NnEtTh_rDeIaRdEsC(Tn,t hSrIeMaPdLsE),, StuimdPIonsBtlDoicvk,( tuhirneta8d_Itd)x . x| )^, gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p391(:g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC') , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 391| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RunW o563r | k < n c csltFeupnSci#z#ef(unnccc,l Sthympeem,. cFoumnmc.#b#udfefvSriezdeosp[P,R ONTCOC_LS_IAMLPGLOE_]#/#NaClCgLo_,S TNECPCSL/_sPiRzOeToOf_(#T#)p)r o{t o >| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) . r| u group(groupn (&ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:.641w:o11r:k )note: ;in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here \ | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' prim s562( | t i d - ttiiddS(ttairdt)R,e dnutcher,e andTsh(rnetahdrseRaeddsu)c,e ,t iddiIrneBclto-c>kd(otwhnr,e a&ddIidrxe.cxt)-,> ogurto,u pa(rggrso-u>ps)e,n d b| u ^~~~~~~~~~~~~~~~~f f, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562g:s60-:> rnote: efield 'group' will be initialized after field 'stepSize'c vbuf f562, | | ^ tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres (nth r202e | a d s ) , t i dRIunnBWloorckkE(ltehmreenatd().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBloxck(threa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~p ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ llReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here | t5i | dI(MtPiLd_)C,O LnLt_hFrUeNaCd(sA(lnltRherdeuacdes,) ,C OtLiLdNIEnTB_lDoIcRkE(CtTh,r eSaIdMIPdLxE.,x )S,u mgProosutpD(igvr,o uupi)n,t 8 _| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h563: | 391 : 95 : snote: texpanded from macro 'IMPL_COLL_FUNC'e pSize( n391c | c l SRhumneWmo.rcko/,s iNzCeCoLf_(ATL)G)O _{# # a| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g o ,| group(groupN CCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:O626_:#9#:p rnote: oin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret o>().r u626n | ( & n c c l S h mpermi.mwso(rtki)d;- t\i d S| t ^a rtScat/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562r:,15 :n Tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eadsSca t562t | e r , NtUiLdL(,t iddi)r,e cntt-h>ruepa,d sa(rngtsh-r>esaednsd)b,u ftfi,d IanrBglso-c>kr(etchvrbeuafdfI,d x .| x ^) , grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(202g:r53o:u pnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : Rnote: ufield 'group' will be initialized after field 'stepSize'n WorkE l562e | m e n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppx:.6x:)1,: gnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo up(g r6o | uIpM)P,L _ C| O ^~~~~~~~~~~L L_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: note: field 'group' will be initialized after field 'stepSize' :562:15 :562 | warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~) , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :R562u:n15W:o rwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]< ncclFunc #562# | f u n c ,t itdy(ptei,d )F,u nnct#h#rdeeavdrse(dnotph),, NtCiCdLI_nABLlGoOc_k#(#tahlrgeoa,d INdCxC.Lx_)P,R OgTrOo_u#p#(pgrrootuop>)(,) . r| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n ( &| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c clSh m563e | m . w o rskt)e;p S\i z e| ( ^n cclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:o562m:m15.:b unote: ffield 'nthreads' will be initialized after field 'tidInBlock'f Sizes[N C562C | L _ P R OtTiOd_(StIiMdP)L,E ]n/tNhCrCeLa_dSsT(EnPtSh/rseiazdeso)f,( Tt)i)d I{n B l| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c k (| t group(grouph readIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h.:x641):,11 :g rnote: oin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu p(gro u641p | ) , | ^~~~~~~~~~~~~~~~~ p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s60(:t inote: dfield 'group' will be initialized after field 'stepSize'- tidSta r562t | R e d u ctei,d (ntTihdr)e,a dnstRherdeuacdes,( ndtihrreecatd-s>)d,o wtni,d I&ndBilroecckt(-t>horueta,d Iadrxg.sx-)>,s egnrdobuupf(fg,r oaurpg)s,- > r| e ^~~~~~~~~~~c vbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork655, NC:C11L:_ Anote: LGin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO _##algo, N C655C | L _ P R O T O _ # # pprroitmos>((t)i.dr-utni(d&SntcacrltSRhemdeumc.ew,o rnkT)h;r e\a d s| R ^e duce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:u15l:l pnote: tfield 'nthreads' will be initialized after field 'tidInBlock'r , &di r562e | c t - > otuitd,( tairdg)s,- >nstehnrdebaudfsf(,n tahrrgesa-d>sr)e,c vtbiudfIfn,B l o| c ^k (threadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r202o:u53p:( gnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo up), 202| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :R60u:n Wnote: ofield 'group' will be initialized after field 'stepSize'r kEleme n562t | < F n , tTi,d (RteiddO)p,, nAtlhgroe,a dPsr(onttoh>r(e)a.drsu)n,( wtei)d;I n B| l ^o ck(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppa:d5I:d1x:. xnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here, gro u5p | (IgMrPoLu_pC)O,L L _| F ^~~~~~~~~~~U NC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s (warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~B loc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize'I dx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s(nth r563e | a d s ) ,s tteipdSIinzBel(oncckc(ltShhrmeeamd.Icdoxm.mx.)b,u fgfrSoiuzpe(sg[rNoCuCpL)_,P R O| T ^~~~~~~~~~~O _SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: warning: initializer order does not match the declaration order [-Wreorder-ctor]initializer order does not match the declaration order [-Wreorder-ctor] 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563563 | | sstteeppSSiizzee((nnccccllSShhmmeemm..ccoommmm..bbuuffffSSiizzeess[[NNCCCCLL__PPRROOTTOO__SSIIMMPPLLEE]]//NNCCCCLL__SSTTEEPPSS//ssiizzeeooff((TT)))) {{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | group(group group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::655655::1111:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655655 | | pprriimmss((ttiidd--ttiiddSSttaarrttRReedduuccee,, nnTThhrreeaaddssRReedduuccee,, nnuullllppttrr,, &&ddiirreecctt-->>oouutt,, aarrggss-->>sseennddbbuuffff,, aarrggss-->>rreeccvvbbuuffff,, | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp::56::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | 6I | MIPMLP_LC_OCLOLL_LF_UFNUCN(CA(lAllRleRdeudcuec,e ,C OCLOLLNLENTE_TD_IDRIERCETC,T ,S ISMIPMLPEL,E ,S uSmuPmoPsotsDtiDvi,v ,u iinntt83_2t_)t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::391 :note: 95expanded from macro 'IMPL_COLL_FUNC': note: expanded from macro 'IMPL_COLL_FUNC' 391 | Run W391o | r k o,p A,L GNOC_C#L#_aAlLgGoO,_ #N#CaClLg_oP,R ONTCOC_L#_#PpRrOoTtOo_>#(#)p.rroutno(>&(n)c.crluSnh(m&enmc.cwloSrhkm)e;m .\w o r| k ^) ; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^: 15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' tid(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~p (g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60):, note: field 'group' will be initialized after field 'stepSize'| ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P RtOiTdOI_n#B#lporcokt(ot>h(r)e.arduInd(x&.nxc)c,l Sghrmoeump.(wgorroku)p;) ,\ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15563: | note: field 'nthreads' will be initialized after field 'tidInBlock' step S562i | z e ( n ctcildS(htmiedm).,c onmtmh.rbeuafdfsS(inztehsr[eNaCdCsL)_,P RtOiTdOI_nSBIlMoPcLkE(]t/hNrCeCaLd_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group Idx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hinitializer order does not match the declaration order [-Wreorder-ctor]: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d ) ,R unntWhorrekaEdlse(mnetnhtrx(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp : 7| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 563 | 7 | sItMePpLS_iCzOeL(Ln_cFcUlNSCh(mAelml.Rceodmumc.eb,u fCfOSLiLzNeEsT[_NDCICRLE_CT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:: 562note: :expanded from macro 'IMPL_COLL_FUNC'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunWorkd,I nNBClCoLc_kA(LtGhOr_e#a#daIldgxo.,x )N,C CgLr_oPuRpO(TgOr_o#u#pp)r,o t o| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( ) .| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u n(&nc c563l | S h m e ms.tweoprSki)z;e (\n c c| l ^S hmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:o562m:m15.:b unote: ffield 'nthreads' will be initialized after field 'tidInBlock'f Sizes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u626p:)9,: note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60626: | note: field 'group' will be initialized after field 'stepSize' p562r | i m s ( ttiidd-(ttiiddS)t,a rnttShcraetatdesr(,n tnhTrheraedasd)s,S ctaitdtIenrB,l oNcUkL(Lt,h rdeiardeIcdtx-.>xu)p,, garrogusp-(>gsreonudpb)u,f f ,| ^~~~~~~~~~~ args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::15 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: 563field 'group' will be initialized after field 'stepSize' | s t562e | p S i z et(indc(ctliSdh)m,e mn.tchormema.dbsu(fnftShirzeeasd[sN)C,C Lt_iPdRIOnTBOl_oScIkM(PtLhEr]e/aNdCICdLx_.SxT)E,P Sg/rsoiuzpe(ogfr(oTu)p)) ,{ | | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56260::15 :note: field 'group' will be initialized after field 'stepSize'warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b15u:f fwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :t53i:d (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d), n t202h | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBr(o)u.pr)u,n ( w| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) ; | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp : 6s:t1e:p Snote: iin instantiation of member function 'RunWork, 2, 2>::run' requested herez e(ncc l6S | hImMePmL._cCoOmLmL._bFuUfNfCS(iAzlelsR[eNdCuCcLe_,P RCOOTLOL_NSEITM_PDLIER]E/CNTC,C LS_ISMTPELPES,/ sSiuzmePoofs(tTD)i)v ,{ i n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~3 2 _| t group(group) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 687note: :expanded from macro 'IMPL_COLL_FUNC'11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | R687u | n W o r k < n c c l Fpurnicm#s#(ftuindc-,t itdySptea,r tFBucnacs#t#,d envTrherdeoapdt,, N&CdCiLr_eAcLtG-O>_o#u#ta,l gnou,l lNpCtCrL,_ PaRrOgTsO-_>#s#epnrdobtuof>f(,) .arrugns(-&>nrcecclvSbhumfefm,. w o| r ^k ); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h53::562 :note: 15in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: note: field 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | RtuindW(otrikdE)l,e mnetnhtr((t)h.rreuand(Iwdex).;x ) ,| ^g roup(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppr:o7u:p1):, note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :7562 | :I60M:P Lnote: _field 'group' will be initialized after field 'stepSize'C O L562L | _ F U N Ct(iAdl(ltRiedd)u,c en,t hCrOeLaLdNsE(Tn_tDhIrReEaCdTs,) ,S ItMiPdLIEn,B lSoucmkP(otshtrDeiavd,I duxi.nxt)3,2 _gtr)o u p| (^g roup)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391 :| 95 ^~~~~~~~~~~: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hidInBl:o562c:k15(:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adIdx.x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize'( nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx.x), 563g | r o u p (sgtreopuSpi)z,e ( n| c ^~~~~~~~~~~c lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:m562e:m15.:w owarning: rinitializer order does not match the declaration order [-Wreorder-ctor]k ); \ | ^562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i15d:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'n thread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d x .| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , gro u563p | ( g r o uspt)e,p S i| z ^~~~~~~~~~~~~~~~~e (ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562h:m60e:m .note: cfield 'group' will be initialized after field 'stepSize'o mm.buf f562S | i z e s [tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:g677r:o11u:p )note: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^~~~~~~~~~~ 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 15: warning: 202initializer order does not match the declaration order [-Wreorder-ctor] | Ru n562W | o r k E lteimde(nttid(I)n.Brluonc(kw(et)h;r e a| d ^I dx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp,: 7g:r1o:u pnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested hereg roup )7, | I M| P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ C| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L L_FUNC (563A | l l R e dsutceep,S iCzOeL(LnNcEcTl_SDhImReEmC.Tc,o mSmI.MbPuLfEf,S izeSsu[mNPCoCsLt_DPiRvO,T Ou_iSnItM3P2L_Et])/ N C| C^L _STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:S391/:s95i:z enote: oexpanded from macro 'IMPL_COLL_FUNC'f (T)) {391 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R u n| W group(groupo rk, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep e, Func# #687d | e v r e d o p < t y pper>i,m sN(CtCiLd_-AtLiGdOS_t#a#ratlBgcoa,s tN,C CnLT_hPrReOaTdOs_B#c#apsrto,t o&>d(i)r.ercutn-(>&onuctc,l Snhumlelmp.twro,r ka)r;g s\- > s| e ^n dbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r15g:s -note: >field 'nthreads' will be initialized after field 'tidInBlock'r ecvb u562f | f , | t ^i d(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren thre a202d | s ) , t i d I nRBulnoWcokr(ktEhlreemaednItd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:)562.:r60u:n (note: wfield 'group' will be initialized after field 'stepSize'e ); | ^562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppd:)6,: 1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads(n t6h | rIeMaPdLs_)C,O LtLi_dFIUnNBCl(oAclkl(Rtehdruecaed,I dCxO.LxL)N,E Tg_rDoIuRpE(CgTr,o uSpI)M,P L E| , ^~~~~~~~~~~ SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id), nth r562e | a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~~~~~~~, grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | 563t | i d ( t istde)p,S inzteh(rnecacdlsS(hnmtehmr.ecaodmsm).,b utfifdSIinzBels[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~o f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:&562n:c15c:l Swarning: hinitializer order does not match the declaration order [-Wreorder-ctor]m em.wo r562k | ) ; \ t i| d ^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's (nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x )563, | g r o uspt(egprSoiuzpe)(,n c c| l ^~~~~~~~~~~~~~~~~S hmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:c562o:m60m:. bnote: ufield 'group' will be initialized after field 'stepSize'f fSize s562[ | N C C L _tPiRdO(TtOi_dS)I,M PnLtEh]r/eNaCdCsL(_nStThErPeSa/dssi)z,e otfi(dTI)n)B l{o c k| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa dIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626g:r9o:u pnote: (in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup) ,626 | | ^~~~~~~~~~~ prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdInBl:o562c:k15(:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e adIdx.x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~) , n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: (field 'group' will be initialized after field 'stepSize'n thre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a dIdx.x )563, | g r o uspt(egprSoiuzpe)(,n c c| l ^~~~~~~~~~~S hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o15u:p (warning: ginitializer order does not match the declaration order [-Wreorder-ctor]r oup), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: initializer order does not match the declaration order [-Wreorder-ctor] :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(gr o563u | p ) , s| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e p S| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)z e(ncclS h563m | e m . c osmtme.pbSuifzfeS(inzcecsl[SNhCmCeLm_.PcRoOmTmO._bSuIfMfPSLiEz]e/sN[CNCCLC_LS_TPERPOST/Os_iSzIeMoPfL(ET])/)N C{C L _| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T E P| S group(group/ sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~641 : 11| : group(group note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641 :p11r:i mnote: sin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid-tid S641t | a r t R e d u c e , pnrTihmrse(atdisdR-etdiudcSet,a rdtiRreedcutc->ed,o wnnT, h&rdeiardecstR-e>douucte,, adrigrs-e>cste-n>ddbouwfnf,, &adrigrse-c>tr-e>covubtu,f fa,r g s| - ^> sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f202f:,53 :a rnote: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ->r e202c | v b u f f , | R ^u nWorkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:<202F:n53,: Tnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R e d O p , ARlugnoW,o Prroto>().ruknE(lweem)e;n t <| F ^n , T, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppR:e8d:O1p:, note: Ain instantiation of member function 'RunWork, 2, 2>::run' requested herel go, 8P | rIoMtPoL>_(C)O.LrLu_nF(UwNeC)(;A l l| R ^e duce, COLLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppT:_7D:I1R:E Cnote: Tin instantiation of member function 'RunWork, 2, 2>::run' requested here, SIMPL E7, | ISMuPmLP_oCsOtLDLi_vF,U NiCn(tA6l4l_Rte)d u c| e^, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:L391:N95E:T _note: Dexpanded from macro 'IMPL_COLL_FUNC'I RECT ,391 | S I MRPuLnEW,o rSku, NCCL _391A | L G OR_u#n#Waolrgko<,n cNcClCFLu_nPcR#O#TfOu_n#c#,p rtoytpoe>,( )F.urnucn#(#&ndcecvlrSehdmoepm<.twyoprek>),; N\C C L| _ ^A LGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:a562l:g15o:, note: Nfield 'nthreads' will be initialized after field 'tidInBlock'C CL_ P562R | O T O _ #t#ipdr(ottiod>)(,) .nrtuhnr(e&andcsc(lnSthhmreema.dwso)r,k )t;i d\I n B| l ^o ck(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,note: field 'nthreads' will be initialized after field 'tidInBlock'g roup(gr o562u | p ) , t| i ^~~~~~~~~~~~~~~~~d (t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,60 :n tnote: hfield 'group' will be initialized after field 'stepSize' rea d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhthrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(60g:r onote: ufield 'group' will be initialized after field 'stepSize'p ), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht64_t:)562 : 15| :^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | RtuindW(otrikd<)n,c cnltFhurneca#d#sf(unntch,r etaydpse),, FtuindcI#n#Bdleovcrke(dtohprx,. xN)C,C Lg_rAoLuGpO(_g#r#oaulpg)o,, N| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C L _| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R OTO_ #563# | p r o t os>t(e)p.Sriuzne((&nnccccllSShhmmeemm..cwoomrmk.)b;u f\f S i| z ^e s[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O15T:O _note: Sfield 'nthreads' will be initialized after field 'tidInBlock'I MPLE] /562N | C C L _ StTiEdP(St/isdi)z,e onft(hTr)e)a d{s ( n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h641r:e11a:d Inote: din instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex .x), g r641o | u p ( g r o u p ) , p r| i ^~~~~~~~~~~~~~~~~m s(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562-:t60i:d Snote: tfield 'group' will be initialized after field 'stepSize'a rtRe d562u | c e , ntTihdr(etaidds)R,e dnutcher,e addisr(enctth-r>edaodwsn),, &tdiidrIencBtl-o>coku(tt,h raeragdsI-d>xs.exn)d,b ugfrfo,u pa(rggrso-u>pr)e,c v b| u ^~~~~~~~~~~f f, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthr_eadsS)T,E PtSi/dsIinzBeloofc(kT()t)h r{e a d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d x .| x group(group) , group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641| : ^~~~~~~~~~~~~~~~~11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: 641field 'group' will be initialized after field 'stepSize' | 562 | p r itmisd((ttiidd-)t,i dnSttharretaRdesd(uncteh,r enaTdhsr)e,a dtsiRdeIdnuBcleo,c kd(itrherceta-d>Iddoxw.nx,) ,& dgirroeucpt(-g>roouutp,) ,a r g| s ^~~~~~~~~~~- >sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~d x.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:t562i:d15-:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]S tartBcast ,562 | n T h r etaidds(Btciads)t,, n&tdhirreeacdts->out,( ndtihrreecatd-s>)d,o wtni,d IanrBglso-c>ks(etnhdrbeuafdfI,d xa.rxg)s,- >grreocuvpb(ugfrfo,u p )| , ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53 :563 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here st e202p | S i z e ( n c c lRSuhnmWeomr.kcEolmemm.ebnutfN(C)C.Lr_uSnT(EwPeS)/;s i z| e ^o f(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp :{7 : 1| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| group(group 7 | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:O666L:L9_:F Unote: Nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC (AllR e666d | u c e , C O L LpNrEiTm_sD(ItRiEdC,T ,n TShIrMePaLdEs,G aStuhmePro,s tdDiirve,c tu-i>nutp3,2 _NtU)L L ,| ^a rgs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h-:>391s:e95n:d bnote: uexpanded from macro 'IMPL_COLL_FUNC'f f, arg s391- | > r eRcuvnbWuofrfk,< n c| c ^l Func##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:u202n:c53,: tnote: yin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herep e, F u202n | c # # d e v r e dRoupnl,e mNeCnCtL<_FAnL,G OT_,# #RaeldgOop,, NAClCgLo_,P RPOrToOt_o#>#(p)r.ortuon>((w)e.)r;u n (| & ^n cclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppm:e9m:.1w:o rnote: kin instantiation of member function 'RunWork, 2, 2>::run' requested here) ; \ 9| | ^I MPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:O562L:L15_:F Unote: Nfield 'nthreads' will be initialized after field 'tidInBlock'C (AllR e562d | u c e , tCiOdL(LtNiEdT)_,D InRtEhCrTe,a dSsI(MnPtLhEr,e aSdusm)P,o sttiDdiIvn,B luoicnkt(6t4h_rte)a d I| d^x .x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :g391r:o95u:p (note: gexpanded from macro 'IMPL_COLL_FUNC'r oup), 391| | ^~~~~~~~~~~~~~~~~ R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562W:o60r:k ,t iNdCICnLB_lAoLcGkO(_t#h#raelagdoI,d xN.CxC)L,_ PgRrOoTuOp_(#g#rporuopt)o,> ( )| . ^~~~~~~~~~~r un(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ C(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:S15I:M Pwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]E ]/NCCL_S T562E | P S / s itziedo(ft(iTd))), {n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group( nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t641i:d11I:n Bnote: lin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo ck(thr e641a | d I d x . x ) , g rporuipm(sg(rtoiudp-)t,i d S| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a r t| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e duce, 563n | T h r e asdtseRpeSdiuzcee(,n cdcilrSehcmte-m>.dcoowmnm,. b&udfifrSeiczte-s>[oNuCtC, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:]15/:N Cwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]L _STEPS/s i562z | e o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677t:i11d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(th r677e | a d I d x . x ) , gprroiumps((gtriodu-pt)i,d S t| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r t B| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a st, n T563h | r e a d ssBtceapsSti,z e&(dnicrcelcSth-m>eomu.tc,o mdmi.rbeucftf-S>idzoewsn[,N CaCrLg_sP-R>OsTeOn_dSbIuMfPfL,E ]a/rNgCsC-L>_rSeTcEvPbSu/fsfi,z e o| f ^( T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 202 :| 53 group(group: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h202: | 677 : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWo r677k | E l e m e n t < F n ,p rTi,m sR(etdiOdp-,t iAdlSgtoa,r tPBrcoatsot>,( )n.Trhurne(awdes)B;c a s| t ^, &direct/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp-:>9o:u1t:, note: direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInin instantiation of member function 'RunWork, 2, 2>::run' requested hereBlock (threadId x9. | xI)M,P Lg_rCoOuLpL(_gFrUoNuCp()A,l l R| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d u c| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), CO L563L | N E T _ DsItReEpCSTi,z eS(InMcPcLlES,h mSeumm.PcoosmtmD.ibvu,f fuSiinzte6s4[_NtC)C L _| P^R OTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:L391E:]95/:N Cnote: Cexpanded from macro 'IMPL_COLL_FUNC'L _STEPS/ s391i | z e oRfu(nTW)o)r k{< n c| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l F u| n group(groupc ##func, t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hy:p641e:,11 :F unote: nin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec ##dev r641e | d o p < t y p e > , pNrCiCmLs_(AtLiGdO-_t#i#daSltgaor,t RNeCdCuLc_eP,R OnTTOh_r#e#apdrsoRteod>u(c)e.,r udni(r&encctc-l>Sdhomwenm,. w&odrikr)e;c t\- > o| u ^t , args->se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:d562b:u15f:f ,note: field 'nthreads' will be initialized after field 'tidInBlock'a rgs->rec v562b | u f f , t i| d ^( tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres (nt h202r | e a d s ) , t iRduInnWBolrokcEkl(etmherneta ().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562(:w60e):; note: field 'group' will be initialized after field 'stepSize'| ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp | : 9 : 1 :t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here( tid )9, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pS(ugmrPoouspt)D,i v ,| ^~~~~~~~~~~u int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] :562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 563| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) st e563p | S i z e (sntcecplSSihzmee(mn.cccolmSmh.mbeumf.fcSoimzme.sb[uNfCfCSLi_zPeRsO[TNOC_CSLI_MPPRLOET]O/_NSCICMLP_LSET]E/PNSC/CsLi_zSeToEfP(ST/)s)i z{e o f| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T ) )| group(group{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 626655: | 9 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here p626r | i m s ( t i d - tpirdiSmtsa(rttiRde-dtuicdeS,t anrTthSrceaatdtseRre,d uncTeh,r enaudlslSpctart,t e&rd,i rNeUcLtL-,> oduitr,e catr-g>su-p>,s eanrdgbsu-f>fs,e nadrbgusf-f>,r eacrvgbsu-f>fr,e c v| b ^u ff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R u n W oRruknEWloermkeEnlte,( )P.rroutno(>w(e)).;r u n| ( ^w e); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp :9:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppnote: :in instantiation of member function 'RunWork, 2, 2>::run' requested here9 :1: note: 9in instantiation of member function 'RunWork, 2, 2>::run' requested here | IMPL _9C | OILMLP_LF_UCNOCL(LA_lFlURNeCd(uAclel,R eCdOuLcLeN,E TC_ODLILRNEECTT_,D ISRIEMCPTL,E ,S ISMuPmLPEo,s tSDuimvP,o sutiDnitv6,4 _uti)n t 6| 4^_ t) | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391expanded from macro 'IMPL_COLL_FUNC': 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R391u | n W oRruknt,y pNeC>C,L _NACLCGLO__A#L#GaOl_g#o#,a lNgCoC,L _NPCRCOLT_OP_R#O#TpOr_o#t#op>r(o)t.or>u(n)(.&rnucnc(l&SnhcmcelmS.hwmoermk.)w;o r\k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~) , | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'group' will be initialized after field 'stepSize'60 : note: field 'group' will be initialized after field 'stepSize' 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~u p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15):, warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL)E,] /gNrCoCuLp_(SgTrEoPuSp/)s,i z e| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f ( T| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~563 | | group(group stepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:n641c:c11l:S hnote: min instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.comm .641b | u f f S i z e s [ N CpCrLi_mPsR(OtTiOd_-StIiMdPSLtEa]r/tNRCeCdLu_cSe, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ TEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: 562note: | in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid( t677i | d ) , n t h r e a dpsr(inmtsh(rteiadd-st)i,d SttiadrItnBBclaosctk,( tnhTrheraedaIddsxB.cxa)s,t ,g r&oduipr(egcrto-u>po)u,t , | d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i r e| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t ->do w563n | , a r gsst-e>psendSbiuzfef(,n cacrlgSsh-m>erme.ccvobmumf.fb,u f f| S ^izes[NCCL_ PROTO_SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:E202]:/53N:C Cnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ STEPS/ s202i | z e o f ( T ) ) R{u n W| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r k E| l group(groupe ment, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, Alg o687, | P r o t o > ( ) .prruinm(sw(et)i;d - t| i ^d StartBcas/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppt:,9 :n1T:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea dsBcas t9, | I&MdPiLr_eCcOtL-L>_oFuUtN,C (nAullllRpetdru,c ea,r gCsO-L>LsNeEnTd_bDuIfRfE,C Ta,r gSsI-M>PrLeEc,v bSuufmfP,o s t| D ^i v, ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t2026:453_:t )note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 :R unote: nexpanded from macro 'IMPL_COLL_FUNC'W orkElemen t391< | F n ,R uTn,W oRrekd (t)y.preu,n (Fwuen)c;# # d| e ^v redo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppp:<9t:y1p:e >note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here NCC L9_ | AILMGPOL__#C#OaLlLg_oF,U NNCC(CALl_lPRReOdTuOc_e#,# pCrOoLtLoN>E(T)_.DrIuRnE(C&Tn,c cSlISMhPmLeEm,. wSourmkP)o;s t\D i v| , ^ uint64_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:)562 : 15| :^ note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: 562expanded from macro 'IMPL_COLL_FUNC' | t391i | d ( tRiudn)W,o rnkt, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx908. 43 warnings generated when compiling for gfx940. 43 warnings generated when compiling for gfx941. 43 warnings generated when compiling for gfx90a. 43 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp53::1 : note: In file included from in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :202169 | : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h : 271 : 19 : warning: Runused variable 'ptr' [-Wunused-variable]u nWorkE l271e | m e n t < F n , uTi,n tR6e4d_Otp*, pAtlrg o=, rPercovtPot>r(()0.)r+ulnl(1w2e8)O;f f s| e ^t ; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i15d:I nwarning: Binitializer order does not match the declaration order [-Wreorder-ctor]l ock(thre a562d | I d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s (| n ^~~~~~~~~~~t hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:(15n:c cwarning: linitializer order does not match the declaration order [-Wreorder-ctor]S hmem.com m562. | b u f f Stiizde(st[iNdC)C,L _nPtRhOrTeOa_dSsI(MnPtLhEr]e/aNdCsC)L,_ StTiEdPISn/Bsliozceko(ft(hTr)e)a d{I d x| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x ) ,| group(groupg roup(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~666 : 9| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 666 | s t e p S i z e (pnrcicmlsS(htmiedm,. cnoTmhmr.ebaudfsfGSaitzheesr[,N CdCiLr_ePcRtO-T>Ou_pS,I MNPULLEL],/ NaCrCgLs_-S>TsEePnSd/bsuifzfe,o fa(rTg)s)- >{r e c| v ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~b u f| f group(group, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::687202::1153:: note: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202687 | | R u npWroirmksE(lteimde-ntti,( )&.driurne(cwte-)>;o u t| , ^ nullp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cppt:r9,: 1a:r gnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here- >send b9u | fIfM,P La_rCgOsL-L>_rFeUcNvCb(uAflfl,R e d| u ^c e, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:T202_:D53I:R Enote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT , SI M202P | L E , S u m P oRsutnDWiovr,k Eulienmte6n4t_ (391) | . r uRnu(nwWeo)r;k < n| c ^c lFunc#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp#:f6u:n1c:, note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herey pe, F u6n | cI#M#PdLe_vCrOeLdLo_pFl,l RNeCdCuLc_eA,L GCOO_L#L#NaElTg_oD,I RNECCCTL,_ PSRIOMTPOL_E#,# pSruomtPoo>s(t)D.irvu,n (i&nntc3c2l_Sth)m e m| .^w ork)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 391\: 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :39115 | : note: Rfield 'nthreads' will be initialized after field 'tidInBlock'u nWork< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~~~~~~~t o>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):.562r:u60n:( ¬e: nfield 'group' will be initialized after field 'stepSize'c clShm e562m | . w o r kt)i;d (\t i d| ) ^, nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15(:n tnote: hfield 'nthreads' will be initialized after field 'tidInBlock'r eads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 43 warnings generated when compiling for host. 43 warnings generated when compiling for gfx900. 43 warnings generated when compiling for gfx1101. 43 warnings generated when compiling for gfx803. 43 warnings generated when compiling for gfx1100. 43 warnings generated when compiling for gfx1030. 43 warnings generated when compiling for gfx1102. 43 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:94:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 94 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(Reduce, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ N, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h##p:r562o:t15: owarning: >initializer order does not match the declaration order [-Wreorder-ctor]( ).run(&ncclShm e562m | . w o r kt)i;d (\t i d| ) ^, nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(15n:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e ads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l o c| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~~~~~~~o mm.bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562S:i60z:e snote: [field 'group' will be initialized after field 'stepSize'N CCL_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :g916r:o7u:p (note: gin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herer oup) ,916 | | ^~~~~~~~~~~ prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWorkElement, 3, 2>::run' requested here: 562:15: 202warning: | initializer order does not match the declaration order [-Wreorder-ctor] R u562n | W o r ktEilde(mteindt)<,F nn,t hTr,e aRdesd(Onpt,h rAelagdos,) ,P rtoitdoI>n(B)l.orcukn((twher)e;a d I| d ^x .x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cppo:u8p:(1g:r onote: uin instantiation of member function 'RunWork, 3, 2>::run' requested herep ), | 8 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | I M| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _COLL _563F | U N C ( AsltleRpeSdiuzcee(,n cCcOlLSLhNmEeTm_.CcHoAmImN.,b uSfIfMSPiLzEe,s [SNuCmCPLo_sPtRDOiTvO,_ SiInMtP6L4E_]t/)N C C| L^_ STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:/391s:i95z:e onote: fexpanded from macro 'IMPL_COLL_FUNC'( T)) { 391| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn Work, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereu nc, t y916p | e , F u n cp#r#idmesv(rgerdoouppg,r oNuCpCNLt_hArLeGaOd_s#,# a&lrgeoc,v ,N C&CsLe_nPdR,O TaOr_g#s#-p>rsoetnod>b(u)f.fr,u na(r&gnsc-c>lrSehcmvebmu.fwfo,r k )| ; ^ \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here562 :15: note: 202field 'nthreads' will be initialized after field 'tidInBlock' | 562 | R u n Wtoirdk(Etliedm)e,n tno(c)k.(rtuhnr(ewaed)I;d x .| x ^) , grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cppp:(6g:r1o:u pnote: )in instantiation of member function 'RunWork, 3, 2>::run' requested here, | ^~~~~~~~~~~~~~~~~ 6 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:_60C:O Lnote: Lfield 'group' will be initialized after field 'stepSize'_ FUNC( A562l | l R e d utcied,( tCiOdL)L,N EnTt_hCrHeAaIdNs,( nStIhMrPeLaEd,s )S,u mtPiodsItnDBilvo,c ki(ntth3r2e_atd)I d x| .^x ), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95(:g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p), | ^~~~~~~~~~~391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ 13 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 81%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, In file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppr:g1s: -In file included from >/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hr:e10d: OIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hA:r168g: ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :0153,: 14a:r gwarning: sunused variable 'data1' [-Wunused-variable]- >connIndex ,153 | a r g su->icnotn3n2I_ntd edxa)t;a 1 ,| ^f lag1,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :d80a:t5a:2 ,note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heref lag 280; | | ^~~~~ runRi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hn:g153<:T21,: Rwarning: eunused variable 'flag1' [-Wunused-variable]d Op, 153P | r o t o >u(ianrtg3s2)_;t d| a ^t a1, f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:a202g:153,: dnote: ain instantiation of member function 'RunWorkElement, 1, 2>::run' requested heret a2, f202l | a g 2 ; | ^~~~~ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hn:W153o:r28k:E lwarning: eunused variable 'data2' [-Wunused-variable]m ent <153F | n , T,u iRnetd3O2p_,t Adlagtoa,1 ,P rfoltaog>1(,) .drautna(2w,e )f;l a g| 2 ^; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h::8153::135:: note: warning: in instantiation of member function 'RunWork, 1, 2>::run' requested hereunused variable 'flag2' [-Wunused-variable] 8153 | | I M P L _uCiOnLtL3_2F_UtN Cd(aRtead1u,c ef,l aRgI1N,G ,d aStIaM2P,L Ef,l aSgu2m;, i| n ^~~~~t 64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloIn file included from a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppt:116: )In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :| 10^: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h391::56295::15 :note: expanded from macro 'IMPL_COLL_FUNC'warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | R u562n | W o r k h,r eNaCdCILd_xA.LxG)O,_ #g#raolugpo(,g rNoCuCpL)_,P R O| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O _ #| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p roto> (563) | . r u n (s&tnecpcSliSzhem(enmc.cwloSrhkm)e;m .\c o m| m ^. buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:s15[:N Cnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'L _PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hI:d34x:.7x:) ,note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup(gr o34u | p ) , | ^~~~~~~~~~~~~~~~~p rims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d60,: nnote: tfield 'group' will be initialized after field 'stepSize'h reads ,562 | & r i n gt-i>dp(rteivd,) ,& rnitnhgr-e>andesx(tn,t harregasd-s>)s,e ntdibduIfnfB,l oacrkg(st-h>rreeacdvIbduxf.fx,) ,a rggrso-u>pr(egdrOopuApr)g,, 0| , ^~~~~~~~~~~ args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h34: | 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]p rims(tid, 562n | t h r e atdisd,( t&irdi)n,g -n>tphrreeva,d s&(rnitnhgr-e>andesx)t,, tairdgIsn-B>lsoecnkd(btuhfrfe,a daIrdgxs.-x>)r,e cgvrbouufpf(,g raorugps)-,> r e| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O p A| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g , 0, a563r | g s - > csotnenpISnidzeex(,n cacrlgSsh-m>ecmo.ncnoImnmd.ebxu)f;f S i| z ^e s[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h_:P80R:O5T:O _note: Sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereI MPL E80] | / N C C Lr_uSnTREiPnSg/ ( a| r group(groupg s); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 34 | 202 | p r i m s ( tRiudn,W onrtkhErleeamdesn,t <&Frni,n gT-,> pRreedvO,p ,& rAilnggo-,> nPerxott,o >a(r)g.sr-u>ns(ewned)b;u f f| , ^ args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppc:v7b:u1f:f ,note: in instantiation of member function 'RunWork, 1, 2>::run' requested herea rgs -7> | rIeMdPOLp_ACrOgL,L _0F,U NaCr(gRse-d>uccoen,n IRnIdNeGx,, SaIrMgPsL-E>,c oSnunmI,n dueixn)t; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing562(:a15r:g swarning: )initializer order does not match the declaration order [-Wreorder-ctor]; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered (tid )202, | n t h r e a d sR(unntWhorrekaEdlse)m,e nttiu(p)(.grruonu(pw)e,) ; | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ing->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreadidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r15g:s -warning: >initializer order does not match the declaration order [-Wreorder-ctor]r ecvbuff, ar g562s | - > r e dtOipdA(rtgi,d )0,, natrhgrse-a>dcso(nnntIhnrdeeaxd,s )a,r gtsi-d>IcnoBnlnoIcnkd(etxh)r;e a d| I ^d x.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :g80r:o5u:p (note: gin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herer oup )80, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ r u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R ing< T563, | R e d Ospt,e pPSriozteo(>n(cacrlgSsh)m;e m .| c ^o mm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:S202i:z53e:s [note: Nin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereC CL_P R202O | T O _ S I M P L ER]u/nNWCoCrLk_ESlTeEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork , N CtCiLd_(AtLiGdO)_,# #natlhgroe,a dNsC(CnLt_hPrReOaTdOs_)#,# ptriodtIon>B(l)o.crku(nt(h&rnecacdlISdhxm.exm).,w ogrrko)u;p (\g r o| u ^p ), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 562 :| 15 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: field 'nthreads' will be initialized after field 'tidInBlock' 563 | 562 | s t e ptSiidz(et(indc)c,l Snhtmherme.acdosm(mn.tbhurfefaSdisz)e,s [tNiCdCILn_BPlRoOcTkO(_tShIrMePadILdEx]./xN)C,C Lg_rSoTuEpP(Sg/rsoiuzpe)o,f ( T| ) ^~~~~~~~~~~~~~~~~) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 60 group(group: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h562: | 34 : 7 : tnote: iin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered (tid), 34n | t h r e a d sp(rnitmhsr(etaidds,) ,n tthirdeIandBsl,o c&kr(itnhgr-e>apdrIedvx,. x&)r,i nggr-o>unpe(xgtr,o uapr)g,s - >| s ^~~~~~~~~~~e ndbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C15L:_ Swarning: Tinitializer order does not match the declaration order [-Wreorder-ctor]E PS/sizeo f562( | T ) ) {t i d| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ) group(group, nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hs:(34n:t7h:r enote: ain instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s), ti d34I | n B l o c k (ptrhirmesa(dtIiddx,. xn)t,h rgeraodusp,( g&rroiunpg)-,> p r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~v , | & tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ing-> n563e | x t , asrtgesp-S>iszeen(dnbcucflfS,h maermg.sc-o>mrme.cbvubfuffSfi,z easr[gNsC-C>Lr_ePdROOTpOA_rSgI,M P0L,E ]a/rNgCsC-L>_cSoTnEnPISn/dseixz,e oafr(gTs)-)> c{o n n| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n d e| x group(group) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h7::80 :note: 5in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 3480 | | r upnrRiimnsg(r(ianrgg-s>)p;r e v| , ^ &ring-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:n202e:x53t:, note: ain instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer gs->s e202n | d b u f f , a rRgusn-W>orrekcEvlbeumfefn,t ,r eRdeOdpOApr,g ,A l0g,o ,a rPgrso-t>oc>o(n)n.Irnudne(xw,e )a;r g s| - ^> connIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cppd:e9x:)1;: note: | in instantiation of member function 'RunWork, 1, 2>::run' requested here ^ 9 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hI:M80P:L5_:C Onote: Lin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereL _FU N80C | ( R e d urcuen,R iRnIgNu(ianrtg6s4)_;t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391202::9553:: note: note: expanded from macro 'IMPL_COLL_FUNC'in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202391 | | R u n W o rRkuy(p)e.>r,u nN(CwCeL)_;A L G| O ^_ ##algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp :N9C:C1L:_ Pnote: Rin instantiation of member function 'RunWork, 1, 2>::run' requested hereO TO_# #9p | rIoMtPoL>_(C)O.LrLu_nF(U&NnCc(cRleSdhumceem,. wRoIrNkG),; S\I M P| L ^E , Sum/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562u:i15n:t 6note: 4field 'nthreads' will be initialized after field 'tidInBlock'_ t) | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95(:t inote: dexpanded from macro 'IMPL_COLL_FUNC') , nthr e391a | d s (RnutnhWroerakd , | N ^~~~~~~~~~~~~~~~~C CL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O60_:# #note: afield 'group' will be initialized after field 'stepSize'l go, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NC C562L | _ A L G Ot_i#d#(atligdo),, NnCtChLr_ePaRdOsT(On_t#h#rperaodtso)>,( )t.irduInn(B&lnoccckl(Sthhmreema.dwIodrxk.)x;) ,\ g r| o ^u p(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,15 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~field 'nthreads' will be initialized after field 'tidInBlock' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d (sttiedp)S,i ntzher(enacdcsl(Snhtmherme.acdosm)m,. btuifdfISniBzleosc[kN(CtChLr_ePaRdOITdOx_.SxI)M,P LgEr]o/uNpC(CgLr_oSuTpE)P,S / s| i ^~~~~~~~~~~~~~~~~z eof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:T562):)60 :{ note: field 'group' will be initialized after field 'stepSize'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:d34(:t7i:d )note: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here nthread s34( | n t h r e a dpsr)i,m st(itdiIdn,B lnotchkr(etahdrse,a d&Irdixn.gx-)>,p rgervo,u p&(rgirnogu-p>)n,e x t| , ^~~~~~~~~~~ args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 75%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkm,, NiCnCtL8__AtL)G O _| #^# algo, NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T391O:_95#:# pnote: rexpanded from macro 'IMPL_COLL_FUNC'o to>().run( &391n | c c lRSuhnmWeomr.kw),, NnCtChLr_eAaLdGsO(_n#t#harlegaod,s )N,C CtLi_dPIRnOBTlOo_c#k#(ptrhorteoa>d(I)d.xr.uxn)(,& ngcrcoluSph(mgermo.uwpo)r,k ) ;| ^~~~~~~~~~~~~~~~~\ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock't id(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~p (group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h.:x514):,9 :g rwarning: ovariable 'offset' set but not used [-Wunused-but-set-variable]u p(g r514o | u p ) , i n| t ^~~~~~~~~~~~~~~~~ off/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:e562t: 60=: tnote: ifield 'group' will be initialized after field 'stepSize'd ; | ^ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdI:n562B:l15o:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t hreadIdx.x )562, | g r o upt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s(nth r563e | a d s ) ,s tteipdSIinzBel(oncckc(ltShhrmeeamd.Icdoxm.mx.)b,u fgfrSoiuzpe(sg[rNoCuCpL)_,P R O| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O _ S| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PLE] /563N | C C L _ SsTtEePpSS/isziez(enocfc(lTS)h)m e{m . c| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m m .| b group(groupu ffSizes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hC:C34L:_7P:R Onote: Tin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO _SIMPL E34] | / N C C L _ SpTrEiPmSs/(stiizde,o fn(tTh)r)e a{d s ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~& r i| n group(groupg ->prev, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h&:r34i:n7g:- >note: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree xt, arg s34- | > s e n d b upfrfi,m sa(rtgisd-,> rnetchvrbeuafdfs,, a&rrgisn-g>-r>epdrOepvA,r g&,r i0n,g -a>rngesx-t>,c oanrngIsn-d>esxe,n dabrugfsf-,> caorngnsI-n>dreexc)v;b u f| f ^, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hs:-80>:r5e:d Onote: pin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereA rg, 800 | , a r grsu-n>Rcionngn(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34C:7: (note: Rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree duce, R34I | N G , S I MpPrLiEm,s (Ptriodd,, nitnhtr6e4a_dts), &| r^i ng->prev, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h&:r391i:n95g:- >note: nexpanded from macro 'IMPL_COLL_FUNC'e xt, args- >391s | e n dRbuunfWfo,r kaFruenccv#b#uffufn,c ,a rtgysp-e>,r eFduOnpcA#r#gd,e v0r,e daorpgpceo>n,n INnCdCeLx_,A LaGrOg_s#-#>aclognon,I nNdCeCxL)_;P R O| T ^O _##pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ht:o80>:(5):. rnote: uin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heren (&n c80c | l S h m ermu.nwRoirnkg)<;T ,\ R e| d ^O p, Proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:(562a:r15g:s )note: ;field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202t:i53d:( tnote: iin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered ), n t202h | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBr(o)u.pr)u,n ( w| e ^~~~~~~~~~~~~~~~~) ; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cppnote: :field 'group' will be initialized after field 'stepSize'5 :1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 562 | 5 | ItMiPdL(_tCiOdL)L,_ FnUtNhCr(eRaeddsu(cnet,h rReIaNdGs,) ,S ItMiPdLIEn,B lPorcokd(,t hurienatd8I_dtx). x )| ,^ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(391g:r95o:u pnote: )expanded from macro 'IMPL_COLL_FUNC', | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h1::562 :note: 15in instantiation of member function 'RunWork, 1, 2>::run' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 10 | IMPL_COLL_ F562U | N C ( R etdiudc(et,i dR)I,N Gn,t hSrIeMaPdLsE(,n tPhrroeda,d sh)a,l ft)i d I| n^B lock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d Inote: dexpanded from macro 'IMPL_COLL_FUNC'x .x), g r391o | u p (RgurnoWuopr)k,< n c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l F u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c ##fun c563, | t y p es,t eFpuSnicz#e#(dnecvcrleSdhompem,. bNuCfCfLS_iAzLeGsO[_N#C#CaLl_gPoR,O TNOC_CSLI_MPPRLOET]O/_N#C#CpLr_oStToE>P(S)/.sriuzne(o&fn(cTc)l)S h{m e m| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w o r| k group(group) ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h::56234::157:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 34 | t i d ( tpirdi)m,s (nttihdr,e andtsh(rnetahdrse,a d&sr)i,n gt-i>dpIrneBvl,o c&kr(itnhgr-e>andeIxdtx,. xa)r,g sg-r>osuepn(dgbruofufp,) ,a r g| s ^~~~~~~~~~~~~~~~~- >re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:v562b:u60f:f ,note: field 'group' will be initialized after field 'stepSize'a rgs-> r562e | d O p Atrigd,( t0i,d )a,r gnst-h>rceoandnsI(nndtehxr,e aadrsg)s,- >tciodnInnIBnldoecxk)(;t h r| e ^a dIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h):,80 :g5r:o unote: pin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here( grou p80) | , | ^~~~~~~~~~~r unRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I dx.x) ,563 | g r o u ps(tgerpoSuipz)e,( n c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l S h| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e m.com m563. | b u f f SsitzeepsS[iNzCeC(Ln_cPcRlOSThOm_eSmI.McPoLmEm]./bNuCfCfLS_iSzTeEsP[SN/CsCiLz_ePoRfO(TTO)_)S I{M P L| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~] / N| C group(groupC L_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hi:z34e:o7f:( Tnote: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 34| | group(group pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hm:s34(:t7i:d ,note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren threads ,34 | & r i n g - >pprriemvs,( t&irdi,n gn-t>hnreexatd,s ,a r&grsi-n>gs-e>npdrbeuvf,f ,& rairnggs-->>nreexctv,b uafrfg,s -a>rsgesn-d>bruefdfO,p Aarrgg,s -0>,r eacrvgbsu-f>fc,o nanrIgnsd-e>xr,e daOrpgAsr-g>,c o0n,n Ianrdgesx-)>;c o n| n ^I ndex,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :a80r:g5s:- >note: cin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereo nnI n80d | e x ) ; r u| n ^R ing, ProtoSimple<1, 1>>' requested here Pro t80o | > ( a r grsu)n;R i n| g ^< T, RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:,202 :P53r:o tnote: oin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here> (arg s202) | ; | ^ RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:r202k:E53l:e mnote: ein instantiation of member function 'RunWorkElement, 1, 2>::run' requested heren tt(<)F.nr,u nT(,w eR)e;d O p| , ^ Algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cppP:r13o:t1o:> (note: )in instantiation of member function 'RunWork, 1, 2>::run' requested here. run (13w | eI)M;P L _| C ^O LL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cppC:(13R:e1d:u cnote: ein instantiation of member function 'RunWork, 1, 2>::run' requested here, RIN G13, | ISMIPMLP_LCEO,L LP_rFoUdN,C (rRcecdlu_cbef,l oRaItN1G6,) S I| M^P LE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:r391o:d95,: rnote: cexpanded from macro 'IMPL_COLL_FUNC'c l_bfloa t3911 | 6 ) R u| n^W ork,, tNyCpCeL,_ AFLuGnOc_####daelvgroe,d oNpCO,T ON_C#C#Lp_rAoLtGoO>_(#)#.arlugno(,& nNcCcClLS_hPmReOmT.Ow_o#r#kp)r;o t\o > (| ) ^. run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:c562l:S15h:m enote: mfield 'nthreads' will be initialized after field 'tidInBlock'. work); 562\ | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~I dx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g60r:o unote: pfield 'group' will be initialized after field 'stepSize'( group) ,562 | | ^~~~~~~~~~~~~~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i60d:) ,note: field 'group' will be initialized after field 'stepSize'n thre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ readIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hinitializer order does not match the declaration order [-Wreorder-ctor]: 562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 563 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | s t563e | p S i z es(tnecpcSliSzhem(enmc.ccloSmhmm.ebmu.fcfoSmimz.ebsu[fNfCSCiLz_ePsR[ONTCOC_LS_IPMRPOLTEO]_/SNICMCPLL_ES]T/ENPCSC/Ls_iSzTeEoPfS(/Ts)i)z e{o f (| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ) | { group(group | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h11::641 :note: 11in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | 641 | p r ipmrsi(mtsi(dt-itdi-dtSitdaSrttaRretdRuecdeu,c en,T hnrTehardesaRdesdRuecdeu,c en,u ldliprterc,t -&>ddiorwenc,t -&>doiurte,c ta-r>gosu-t>,s eanrdgbsu-f>fs,e nadrbgusf-f>,r eacrvgbsu-f>fr,e c v| b ^u ff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 202:53: 202note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R u n W o r k ERluenmWeonrtk,( )P.rroutno(>w(e)).;r u n| ( ^w e); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :4:1 :4 | note: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hereM PL_C O4L | LI_MFPULN_CC(OALlLl_RFeUdNuCc(eA,l lCROeLdLuNcEeT,_ DCIORLELCNTE,T _SDIIMRPELCET,, SSuImM,P LiEn,t 8S_utm), i| n^t 8_t)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| :^95 : note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: 391note: | expanded from macro 'IMPL_COLL_FUNC' RunWo r391k | < n cRculnFWuonrck#<#nfcucnlcF,u ntcy#p#ef,u nFcu,n ct#y#pdee,v rFeudnocp#<#tdyepver>e,d oNpCG,O _N#C#CaLl_gAoL,G ON_C#C#La_lPgRoO,T /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hON:_C562#C:#L15p_:rP oRwarning: tOinitializer order does not match the declaration order [-Wreorder-ctor]oT >O(_)#.#rpurno( t&562on | >c (c )l .S rhtumined(m(&.tnwicodcr)lk,S) h;nm te\hm r. ew| ao ^dr sk()n;t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h h:\r562 e: a15| d: ^s )note: ,field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h t:i562d: I15562n: | B lnote: ofield 'nthreads' will be initialized after field 'tidInBlock' c kt(i td562h( | rt ei tiaddd)(I,td ixnd.t)xh,)r ,en atgdhrsro(eunaptd(hsgr(renoatudhpsr))e,,a d ts| i) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d, I nt| Bi tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)ld oIcnkB( lt563oh | cr ke (a td hIsrdtexea.pdxSI)id,zx e.g(xrn)oc,uc plg(Srghormuoepum(p.g)cr,oo mu mp| .) ^~~~~~~~~~~~~~~~~b, u f /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf| :S ^~~~~~~~~~~~~~~~~562i :z60e:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs :[note: 562Nfield 'group' will be initialized after field 'stepSize':C 60C:L _ note: P562field 'group' will be initialized after field 'stepSize'R | O T O _ 562St | Ii Md P( Lt Eit]di/)dN,(C tCniLtd_h)Sr,Te aEndPtsSh(/rnsetiahzdreseo(afnd(tsTh))r,)e at{di sd )I| ,n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ B tl io| dc group(groupIk n(Btlhorceka(dtIhdr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hxe:.a666xd:)I9,d: x g.note: rxin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo) u,p (gg rr666oo | uu pp () g, r o u| p ^~~~~~~~~~~ ) p,r i m| s ^~~~~~~~~~~( tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: field 'nthreads' will be initialized after field 'tidInBlock'562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~( gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)60,: note: | field 'group' will be initialized after field 'stepSize' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563t | i d ( t isdt)e,p Snitzher(enacdcsl(Snhtmherme.acdosm)m,. btuifdfISniBzleosc[kN(CtChLr_ePaRdOITdOx_.SxI)M,P LgEr]o/uNpC(CgLr_oSuTpE)P,S / s| i ^~~~~~~~~~~z eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :gr562o:u15p:( gwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o up), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pwarning: )initializer order does not match the declaration order [-Wreorder-ctor], | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56260 | : note: field 'group' will be initialized after field 'stepSize' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NUNCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :warning: 60initializer order does not match the declaration order [-Wreorder-ctor]: note: field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthreads)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t15e:p Swarning: iinitializer order does not match the declaration order [-Wreorder-ctor]z e(ncclShm e562m | . c o m mt.ibdu(ftfiSdi)z,e sn[tNhCrCeLa_dPsR(OnTtOh_rSeIaMdPsL)E,] /tNiCdCILn_BSlToEcPkS(/tshirzeeaodfI(dTx).)x ){, g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ( group(groupg roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :666:9: 563note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here st e666p | S i z e ( n c c lpSrhimmesm(.tciodm,m .nbTuhfrfeSaidzseGsa[tNhCeCrL,_ PdRiOrTeOc_tS-I>MuPpL,E ]N/UNLCLC,L _aSrTgEsP-S>/sseinzdebouff(fT,) )a r{g s -| > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e c| v group(groupb uff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53 :666 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202p | r i m s ( t i d ,R unnTWhorrekaEdlseGmaetnhteOupp,, ANlUgLoL,, Parrogtso->>(s)e.nrdubnu(fwfe,) ;a r g| s ^- >recvbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppf:f4,: 1 :| ^note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h4: | 202I:M53P:L _note: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereO LL_F U202N | C ( A l l R e d uRcuen,W oCrOkLELlNeEmTe_nDtI ( )| .^r un(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4: 1391: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereR unW o4r | kII,M PNLCEC,L _SAuLmG,O _i#n#ta8l_gto), N| C^C L_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T391O:_95#:# pnote: roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hce, nu:l562l:p15t:r ,warning: initializer order does not match the declaration order [-Wreorder-ctor]& direct->out, a r562g | s - > s etniddb(utfifd,) ,a rngtsh-r>eraedcsv(bnutfhfr,e a d| s ^) , tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B202l:o53c:k (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh rea d202I | d x . x ) , g rRouunpW(ogrrkoEulpe)m,e n t| < ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~F n ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T , RedO p563, | A l g os,t ePprSoitzoe>((n)c.crluSnh(mweem).;c o m| m ^. buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppi:z5e:s1[:N Cnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _PR O5T | OI_MSPILM_PCLOEL]L/_NFCUCNLC_(SATlElPRSe/dsuiczee,o fC(OTL)L)N E{T _ D| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R E C| T group(group, SIMPLE, Sum, u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:n677t:811_:t )note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h677: | 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' pr i391m | s ( tRiudn-WtoirdkSoopur,e cNtC-C>Ld_oAwLnG,O _a#r#gasl-g>os,e nNdCbCuLf_fP,R OaTrOg_s#-#>prreoctvob>u(f)f.,r u n| ( ^& ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:w202o:r53k:) ;note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here\ | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :R unote: nfield 'nthreads' will be initialized after field 'tidInBlock'W orkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C15C:L _warning: Pinitializer order does not match the declaration order [-Wreorder-ctor]R OTO_##pro t562o | > ( ) . rtuind((&tnicdc)l,S hnmtehmr.ewaodrsk()n;t h\r e a| d ^s ), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads), 563t | i d I n BsltoecpkS(itzher(enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~~~~~~~C L_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:R562O:T60O:_ Snote: Ifield 'group' will be initialized after field 'stepSize'M PLE]/ N562C | C L _ S TtEiPdS(/tsiidz)e,o fn(tTh)r)e a{d s (| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:c666k:(9t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea dIdx. x666) | , g r o u p ( gprroiumps)(,t i d| , ^~~~~~~~~~~ nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->upS,I MaPrLgEs,- >Ssuemn,d biunftf3,2 _atr)g s -| >^r ecvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 ^: 95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 391 | R202u | n W o r k < n c cRluFnuWnocr#k#Efluenmce,n tto,> (N)C.CrLu_nA(LwGeO)_;# # a| l ^g o, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppL:_4P:R1O:T Onote: _in instantiation of member function 'RunWork, 2, 2>::run' requested here# #pro t4o | >I(M)P.Lr_uCnO(L&Ln_cFcUlNSCh(mAelml.Rweodrukc)e;, \C O L| L ^N ET_DIRECT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562S:I15M:P Lnote: Efield 'nthreads' will be initialized after field 'tidInBlock', Sum, i562n | t 8 _ t )t i d| (^t id), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'( nthrea d391s | ) , RtuindWIonrBkl60,: Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_ALG O562_ | # # a l gtoi,d (NtCiCdL)_,P RnOtThOr_e#a#dpsr(onttoh>r(e)a.drsu)n,( &tnicdcIlnSBhlmoecmk.(wtohrrke)a;d I\d x .| x ^) , gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r onote: ufield 'nthreads' will be initialized after field 'tidInBlock'p ), | ^~~~~~~~~~~562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h n:| t562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h: r15e:a d563warning: s | initializer order does not match the declaration order [-Wreorder-ctor]( n t h rsetaedpsS)i ,z562 e | t( in dc Ic nltBSilhdom(cetkmi(.dtc)ho,rm emna.tdbhIurdfexfa.Sdxis)z(,en stg[hrNroCeuCapLd(_sgP)rR,oO uTtpOi)_d,SI In MB| Pl ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~Lo Ec ]k| /( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)Nt ChCrL e_563aS | dT IE dP xS ./sxst)ie,zp eSgoirfzo(euT(p)n()cg cr{lo Su hp| m) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e, m . | c| group(groupo ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ m m .| b tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u ff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:i677 z:563e11 | s: [ Nnote: Cin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here C sLt_eP pR677SO | iT zO e_ (S nI cM cP lL SE h] m/peNrmCi.CmcLso_(mStmTi.EdbP-uStf/ifsdSiSiztzeaeorsft[(BNTcC)aC)sL t_{,P R nO| TT ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hO r_ eS| aI group(groupdM sPBLcEa]s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/t:N,677C :C&11Ld:_i Srnote: Tein instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereEc PtS-/> so677iu | zt e, o fd (i Tr )e )c t {- >p dr| oi ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~wm ns ,(| t group(groupai rdg-st-i>dsSetnadrbtuBf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hcf:a,677s :ta11,r: g nsnote: T-in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh> rreeacdvsb Bu677cf | af s, t , | & ^ d i r e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h c:pt202r-:i>53mo:su (tnote: t,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei dd- it202ri | ed cS tt -a >r dt oB wc naR,su tna,Wr ognrsTk-hE>rlseeeamndedsnbBtucdrOepc,v bAulfgfo,, P| r ^o to>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:u202n:(53w:e )note: ;in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 6 : 1 : Rnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren Work E6l | eImMePnLt_E(T)_.DrIuRnE(CwTe,) ;S I M| P ^L E, Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppm:,5 :i1n:t 3note: 2in instantiation of member function 'RunWork, 2, 2>::run' requested here_ t) | 5^ | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:O391L:L95_:F Unote: Nexpanded from macro 'IMPL_COLL_FUNC'C (AllRed u391c | e , RCuOnLWLoNrEkT<_nDcIcRlEFCuTn,c #S#IfMuPnLcE,, tSyupme,, uFiunntc8#_#td)e v r| e^d op:, note: Nexpanded from macro 'IMPL_COLL_FUNC'C CL_ALG O391_ | # # aRlugnoW,o rNkCt(y)p.er,u nF(u&nncc#c#ldSehvmreemd.owpo\, N| C ^C L_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hG:O562_:#15#:a lnote: gfield 'nthreads' will be initialized after field 'tidInBlock'o , NCC L562_ | P R O T Ot_i#d#(ptriodt)o,> (n)t.hrruena(d&sn(cnctlhSrhemaedms.)w,o rtki)d;I n\B l o| c ^k (th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15I:d xnote: .field 'nthreads' will be initialized after field 'tidInBlock'x ), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: (field 'group' will be initialized after field 'stepSize'n thre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~t hr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I60d:x .note: xfield 'group' will be initialized after field 'stepSize') , gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ irect->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:5:1::562 :note: 15in instantiation of member function 'RunWork, 2, 2>::run' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 5 | IMPL_CO L562L | _ F U N Ct(iAdl(ltRiedd)u,c en,t hCrOeLaLdNsE(Tn_tDhIrReEaCdTs,) ,S ItMiPdLIEn,B lSoucmk,( tuhirneta8d_Itd)x . x| )^, group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95):, note: expanded from macro 'IMPL_COLL_FUNC'| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | 563R | u n W o rskt_,S INMCPCLLE_]A/LNGCOC_L#_#SaTlEgPoS,/ sNiCzCeLo_fP(RTO)T)O _{# # p| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o t o| > group(group( ).run(&ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hw:o677r:k11):; note: \in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: pfield 'nthreads' will be initialized after field 'tidInBlock'r ims(ti d562- | t i d S ttairdt(Btciads)t,, nntThhrreeaaddss(Bnctahsrte,a d&sd)i,r etcitd-I>noBulto,c kd(itrherceta-d>Iddoxw.nx,) ,a rggrso-u>ps(egnrdobuupf)f,, a| r ^~~~~~~~~~~~~~~~~g s->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b60u:f fnote: ,field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:(53t:i dnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, nth r202e | a d s ( n t h r eRaudnsW)o,r ktEildeImneBnltou(p)).,r u n| ( ^~~~~~~~~~~w e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a ds), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k60(:t hnote: rfield 'group' will be initialized after field 'stepSize'e adIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nthre a563d | s ) , tsitdeIpnSBilzoec(kn(threcacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~L _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~563 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) step S563i | z e ( n csctleSphSmiezme.(cnocmcml.SbhumfefmS.iczoemsm[.NbCuCfLf_SPiRzOeTsO[_NSCICMLP_LPER]O/TNOC_CSLI_MSPTLEEP]S//NsCiCzLe_oSfT(ETP)S)/ s{i z e| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f ( T| ) group(group) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 655 group(group: 11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :655641 | : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here pr i641m | s ( t i d - t i d S tparritmRse(dtuicde-,t indTShtraeratdRseRdeudcuec,e ,n TnhurlelapdtsrR,e d&udcier,e cdti-r>eocutt-,> daorwgns,- >&sdeinrdebcutf-f>,o uatr,g sa-r>grse-c>vsbeunfdfb,u f f| , ^ args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202c:v53b:u fnote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :R53u:n Wnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer kEle m202e | n t().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hGO_#:#562a:l15g:o ,warning: initializer order does not match the declaration order [-Wreorder-ctor]N CCL_PROTO_ #562# | p r o t ot>i(d)(.triudn)(,& nnctchlrSehamdesm(.nwtohrrke)a;d s\) , | t ^i dInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock'd: I562d:x 15.562:x | ) warning: , initializer order does not match the declaration order [-Wreorder-ctor] g rtoiudp((tg ir562do | )u ,p ) n, t th ir| de ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~(a td is| (d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n) t,h r ne563ta | hd rs e) a, d sstt(iendptIShnirBzeleoa(cdnksc()ct,lh SrtheimdaeIdmnI.dBclxoo.mcxmk).(,bt uhgfrrfeoSauidpzI(edgsxr[.oNxuC)pC,)L ,_g Pr Ro| Ou ^~~~~~~~~~~~~~~~~Tp O(_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgS:rI562oM:uP60pL):E, ]note: / field 'group' will be initialized after field 'stepSize'N| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C L _562| S | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T E P S / ts563ii | dz (e to if d(s)Tt,)e )np tS{hi rz ee| a( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dn sc (c| nl group(grouptS hhrmeeamd.sc)o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,m: m677t.:bi11ud:fI fnnote: SBin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereil zoecs k[677(N | tC Ch Lr e_ aP dR IO dT xO ._ SxpI)rM,iP mLgsEr(]otu/ipNd(C-gCtrLio_duSSpTt)Ea,Pr St/ Bs| ci ^~~~~~~~~~~a zse/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hto:,562f :(n15TT:)h )rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]{a d s| B ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c a s562| t | group(group, & d itriedc(tt-i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h>d:o), n626:9: utnote: thin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here,r edaidrse( nc626tth | -r >e d ao dws n) ,, atprrigidsIm-ns>B(slteiondcd-kb(tuitfdhfSr,te aaardrtIgSdscx-a.>txrt)ee,cr v,gb runofTufhp,r( eg ar| do ^su Spc)a,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht :t 202e| :r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~53, : | Nnote: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)Uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here L L, d563202i | | r e c t -s >t ue pp S,Ri uzanerW(gonsrc-kc>ElslSeehnmdmebenumtf.eSrdieOzcpve,bs u[AfNlfCg,Co L,_ P| PR ^r OoTtOo_>S(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI):M.202Pr:Lu53En:]( /wnote: Nein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC) C;L _ S | T202 ^E | P S / /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpps :i 5 z: e1 o:Rf u(note: Tnin instantiation of member function 'RunWork, 2, 2>::run' requested here)W o)r k5{E | lI eMm| Pe ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Ln _t C<| OF group(groupLn L,_ FTU,N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h C:R(655eA:dl11Ol:pR ,enote: din instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereAu lcgeo, , 655 C | PO rL oL tN oE >T (_ )D .I rR uEpnCr(Tiw,me s)S(;It Mi Pd| L- ^Et ,i dSSutma,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr :tu6Ri:en1dt:u8 c_note: etin instantiation of member function 'RunWork, 2, 2>::run' requested here,) n T| h^6r | eIaMd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hPs:LR391_e:Cd95Ou:Lc Lenote: _,expanded from macro 'IMPL_COLL_FUNC'F UnNuCl( lA391pl | tl r ,R u&ndWiorrekccoluFtu,n ca#r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#gfsu-n>c:s,562e :nt15d:yb puwarning: ef,initializer order does not match the declaration order [-Wreorder-ctor]f ,F uanrcg#s#-d>erverc ev562db | ou p f< ft ,yt pi ed| >( ^,t iNdC)C,L _n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hAt:Lh202Gr:Oe53_a:#d #snote: a(in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereln gtoh,r e202Na | Cd Cs L) _, P Rt Oi Td OIR_nu#Bn#lWpoorcrokkt(Eotl>he(rm)ee.anrdtuI| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) .r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:n562( :w56315e | :) ; note: field 'nthreads' will be initialized after field 'tidInBlock' | s ^t ep S562i | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppz :e 6( :n 1ct:ci ldnote: S(in instantiation of member function 'RunWork, 2, 2>::run' requested hereht miedm) .,6c | onImtMmhP.rLbe_uaCfdOfsL(SLni_tzFheUrsNe[CaN(dCAsCl)Ll,_R PetRdiOudTcIOen_,BSl IoCMcOPkLL(LEtN]hE/rTNe_CaDCdILIR_dESxCT.TEx,P) S,S/ IsgMirPzoLeuEop,f( (gSTru)om)u, p {)i ,n t| 3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| 2 ^~~~~~~~~~~~~~~~~_ t| ) group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hfield 'group' will be initialized after field 'stepSize': 391:95: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h note: :562expanded from macro 'IMPL_COLL_FUNC'655 | : 11 : note: 391tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | i d (Rtuin dW655)o, | r kn ,R, e gdNruCocCueLp,_( AgnLruGolOul_pp#)t#,ra ,l g| &o ^~~~~~~~~~~d, i rNeCcCtL-_>PoRuOtT,O _a#r#gpsr-o>tsoe>n(d)b.urfufn,( &anrcgcsl-S>hrmeecmv.bwuofrfk,) ; | \ ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562202 | | t i d ( tRiudn)W,o rnktEhlreemaednst(().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(tid:)562,: 15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads(nthrea d562s | ) , t itdiIdn(Btliodc)k,( tnhtrheraedaIddsx(.nxt)h,r egardosu)p,( gtrioduIpn)B,l o c| k ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e adId x563. | x ) , gsrtoeuppS(igzreo(unpc)c,l S h| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e m .| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o mm.b u563f | f S i z esst[eNpCSCiLz_eP(RnOcTcOl_SShImMePmL.Ec]o/mNmC.CbLu_fSfTSEiPzSe/ss[iNzCeCoLf_(PTR)O)T O{_ S I| M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P L E| ] group(group/ NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:P666S:/9s:i znote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo f(T) )666 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupp rims(tid, nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r666e:a9d:s Gnote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret her, d i666r | e c t - > u p , pNrUiLmLs,( tairdg,s -n>TshernedabdusfGfa,t haerrg,s -d>irreeccvtb-u>fufp,, N| U ^L L, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:s202-:>53s:e nnote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereb uff, 202a | r g s - > r e c vRbuunfWfo,r k E| l ^e ment, 2, 2>::run' requested herep , Al g202o | , P r o t o > (R)u.nrWuonr(kwEel)e;m e n| t ^< Fn, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppT:,6 :R1e:d Onote: pin instantiation of member function 'RunWork, 2, 2>::run' requested here, Al g6o | ,I MPPrLo_tCoO>L(L)_.FrUuNnC((wAel)l;R e d| u ^c e, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppE:T5_:D1I:R Enote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereT , SI M5P | LIEM,P LS_uCmO,L Li_nFtU3N2C_(tA)l l R| e^d uce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391C:O95L:L Nnote: Eexpanded from macro 'IMPL_COLL_FUNC'T _DIRE C391T | , SRIuMnPWLoEr,k , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:A Lwarning: Ginitializer order does not match the declaration order [-Wreorder-ctor]O _##algo, NC C562L | _ P R O TtOi_d#(#tpirdo)t,o >n(t)h.rreuand(s&(nnctchlrSehamdesm).,w otrikd)I;n B\l o c| k ^( threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' grou p562( | g r o u pt)i,d ( t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n thread s563( | n t h r esatdesp)S,i ztei(dnIcncBllSohcmke(mt.hcroemamd.Ibduxf.fxS)i,z egsr[oNuCpC(Lg_rPoRuOpT)O,_ S I| M ^~~~~~~~~~~~~~~~~P LE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C60L:_ Snote: Tfield 'group' will be initialized after field 'stepSize'E PS/s i562z | e o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 666t:i9d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock( t666h | r e a d I d x . xp)r,i mgsr(otuipd(,g rnoTuhpr)e,a d s| G ^~~~~~~~~~~a ther, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eRadIdexd.uxc)e,, gCrOoLuLpN(EgTr_oDuIpR)E,C T ,| ^~~~~~~~~~~~~~~~~S IMPLE, Sum/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562i:n60t:3 2note: _field 'group' will be initialized after field 'stepSize't ) | ^ 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:(391t:i95d:) ,note: expanded from macro 'IMPL_COLL_FUNC'n threads (391n | t h rReuandWso)r,k , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ redop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9s:[ Nnote: Cin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC L_PROT O666_ | S I M P L E ] / NpCrCiLm_sS(TtEiPdS,/ sniTzheroefa(dTs)G)a t{h e r| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ d i| r group(groupe ct->up, NULL, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:-677>:s11e:n dnote: bin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu ff, ar g677s | - > r e c v b u f f ,p r i| m ^s (tid-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:r202t:B53c:a snote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, nTh r202e | a d s B c a s t ,R u&ndWiorrekcEtl-e>moeuntt,< Fdni,r eTc,t -R>eddoOwpn,, Aalrggos,- >Psreontdob>u(f)f.,r uanr(gwse-)>;r e c| v ^b uff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp ^: 6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 2026: | 53I:M Pnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ COLL_ F202U | N C ( A l l R e dRuucneW,o rCkOELlLeNmEeTn_tDt()) . r| u^n (we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 391 :| 95 ^: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp: 6391: | 1 : Rnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren Work <6n | cIcMlPFLu_nCcO#L#Lf_uFnUcN,C (tAylpleR,e dFuucnec,# #CdOeLvLrNeEdTo_pD,, SNICMCPLL_EA,L GSOu_m#,# ailngto3,2 _NtC)C L _| P^R OTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:p391r:o95t:o >note: (expanded from macro 'IMPL_COLL_FUNC') .run(& n391c | c l SRhumneWmo.rwkod,) ,N CnCtLh_rAeLaGdOs_(#n#tahlrgeoa,d sN)C,C Lt_iPdRIOnTBOl_o#c#kp(rtohtroe>a(d)I.drxu.nx()&,n cgcrloSuhpm(egmr.owuopr)k,) ; | \ ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldox.x), gcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s(nthrea d562s | ) , t itdiIdn(Btliodc)k,(threa dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o m m| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)b uffSi z563e | s [ N C CsLt_ePpRSOiTzOe_(SnIcMcPlLSEh]m/eNmC.CcLo_mSmT.EbPuSf/fsSiizzeeosf[(NTC)C)L _{P R O| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O _ S| I group(groupM PLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:T655E:P11S:/ snote: iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herez eof(T) )655 | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:-677t:i11d:S tnote: ain instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer tRedu c677e | , n T h r e a d s Rperdiumcse(,t indu-ltlipdtSrt,a r&tdBicraesctt,- >noTuhtr,e aadrsgBsc-a>sste,n d&bduifrfe,c ta-r>gosu-t>,r edcivrbeucftf-,> d o| w ^n , args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:e202n:d53b:u fnote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, arg s202- | > r e c v b u f fR,u n W| o ^r kElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:<202F:n53,: Tnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here RedO p202, | A l g o , P rRoutnoW>o(r)k.Erluenm(ewnet)<;F n ,| ^T , RedO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppp:,7 :A1l:g onote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here Prot o7> | (I)M.PrLu_nC(OwLeL)_;F U N| C ^( AllRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppc:e5,: 1C:O Lnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereN ET_D I5R | EICMTP,L _SCIOMLPLL_EF,U NSCu(mA,l luRiendtu3c2e_,t )C O L| L^N ET_DI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:E391C:T95,: Snote: Iexpanded from macro 'IMPL_COLL_FUNC'M PLE, S u391m | , uRiunntW8o_rtk)< n c| c^l Func/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:#391f:u95n:c ,note: expanded from macro 'IMPL_COLL_FUNC't ype, F u391n | c # #RduenvWroerdkon,c #N#CfCuLn_cA,L GtOy_p#e#,a lFguon,c #N#CdCeLv_rPeRdOoTpO<_t#y#pper>o,t oN>C(C)L._rAuLnG(O&_n#c#callSghom,e mN.CwCoLr_kP)R;O T\O _ #| # ^p roto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:)562.:r15u:n (note: &field 'nthreads' will be initialized after field 'tidInBlock'n cclSh m562e | m . w o rtki)d;( t\i d )| , ^ nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(15n:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e ads), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~l ock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r60e:a dnote: Ifield 'group' will be initialized after field 'stepSize'd x.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n60t:h rnote: efield 'group' will be initialized after field 'stepSize'a ds), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhreadI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(group), | ^~~~~~~~~~~562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWork, 2, 2>::run' requested here: 562:15: 6warning: | initializer order does not match the declaration order [-Wreorder-ctor]I MPL_COLL_ F562U | N C ( A ltliRde(dtuicde),, CnOtLhLrNeEaTd_sD(InRtEhCrTe,a dSsI)M,P LtEi,d ISnuBml,o cikn(tt3h2r_eta)d I d| x^. x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r391o:u95p:( gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up), | 391 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u nWork <563n | c c l F usntce#p#Sfiuznec(,n ctcylpSeh,m eFmu.ncco#m#md.ebvurfefdSoipzC,L _NPCRCOLT_OA_LSGIOM_P#L#Ea]l/gNoC,C LN_CSCTLE_PPSR/OsTiOz_e#o#fp(rTo)t)o >{( ) .| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u n (| & group(groupn cclShmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:r626k:)9;: \note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :p rnote: ifield 'nthreads' will be initialized after field 'tidInBlock'm s(tid -562t | i d S t atritdS(ctaitdt)e,r ,n tnhTrheraedasd(snStchartetaedrs,) ,N UtLiLd,I ndBilroecckt(-t>hurpe,a daIrdgxs.-x>)s,e ngdrbouufpf(,g raorugps)-,> r e| c ^~~~~~~~~~~~~~~~~v buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 60 ^: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :56253 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here tid( t202i | d ) , n t h r eRaudnsW(onrtkhErleeamdesn)t,< Ftni,d ITn,B lRoecdkO(pt,h rAelagdoI,d xP.rxo)t,o >g(r)o.urpu(ng(rwoeu)p;) , | ^| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hU:L562L:,15 :d iwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ct->up, a r562g | s - > s etniddb(utfifd,) ,a rngtsh-r>eraedcsv(bnutfhfr,e a d| s ^) , ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I202n:B53l:o cnote: kin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( thr e202a | d I d x . x ) , RgurnoWuopr(kgErloeumpe)n,t < F| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, T| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RedOp ,563 | A l g o ,s tPerpoStioz>e(()n.crculnS(hwmee)m;. c o| m ^m .buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppS:i6z:e1s:[ Nnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereC L_PR O6T | OI_MSPILM_PCLOEL]L/_NFCUCNLC_(SATlElPRSe/dsuiczee,o fC(OTL)L)N E{T _ D| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R E C| T group(group, SIMPLE, Sum,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :i641n:t113:2 _note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) | ^ 641/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' prim s391( | t i dR-utniWdoSrtkaddoopwi,r eNcCtC-L>_oAuLtG,O _a#r#gasl-g>os,e nNdCbCuLf_fP,R OaTrOg_s#-#>prreoctvob>u(f)f.,r u n| ( ^& ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hw:o202r:k53):; note: \in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock'R unWo r562k | E l e m etnitd<(Ftni,d )T,, nRtehdrOepa,d sA(lngtoh,r ePardost)o,> (t)i.drIunnB(lwoec)k;( t h| r ^e adIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppx:)7,: 1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ( g7r | oIuMpP)L,_ C O| L ^~~~~~~~~~~~~~~~~L _FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:(562A:l60l:R enote: dfield 'group' will be initialized after field 'stepSize'u ce, CO L562L | N E T _ DtIiRdE(CtTi,d )S,I MnPtLhEr,e aSdusm(,n tuhirneta3d2s_)t,) t i| d^I nBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k391(:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd Idx.x )391, | g rRouunpW(ogrrko, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf,: 562 :| 15 ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202562: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret id(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n tto(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp563: | 5 : 1 : snote: tin instantiation of member function 'RunWork, 2, 2>::run' requested heree pSize (5n | cIcMlPSLh_mCeOmL.Lc_oFmUmN.Cb(uAflflSRiezdeusc[eN,C CCLO_LPLRNOETTO__DSIIRMEPCLTE,] /SNICMCPLL_ES,T ESPuSm/,s iuzienotf8(_Tt))) {| ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hexpanded from macro 'IMPL_COLL_FUNC': 655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 391 | R u655n | W o r k < n c c l F upnrci#m#sf(utnid-tidcS,t atrytpRee,d uFcuen,c #n#TdherveraeddsoRpe ,n uNlClCpLt_rA,L G&Od_i#r#eacltg-o>,o uNtC,C La_rPgRsO-T>Os_e#n#dpbruoftfo,> (a)r.grsu-n>(r&enccvcbluSfhfm,e m .| w ^o rk); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :20215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' R u562n | W o r k Etliedm(etnitd<)F,n ,n tTh,r eRaeddsO(pn,t hArlegaod,s )P,r ottiod>I(n)B.lroucnk((wteh)r;e a d| I ^d x.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :g7r:o1u:p (note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup )7, | I M| P ^~~~~~~~~~~~~~~~~L _CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562_:F60U:N Cnote: (field 'group' will be initialized after field 'stepSize'A llRed u562c | e , C OtLiLdN(EtTi_dD)I,R EnCtTh,r eSaIdMsP(LnEt,h rSeuamd,s )u,i ntti3d2I_ntB)l o c| k^( thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391I:d95x:. xnote: )expanded from macro 'IMPL_COLL_FUNC', group (391g | r o uRpu)n,W o r| k ^~~~~~~~~~~< ncclFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o uwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]) , | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202:53:562 :note: 15in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 202 | 562 | R u ntWiodr(ktEilde)m,e nnttc(k)(.trhurne(awdeI)d;x . x| ) ^, group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppg:r6o:u1p:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 6 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | IMPL _563C | O L L _ FsUtNeCp(SAilzleR(endcuccleS,h mCeOmL.LcNoEmTm_.DbIuRfEfCSTi,z eSsI[MNPCLCEL,_ PSRuOmT,O _iSnItM3P2L_Et])/ N C| C^L _STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:/391s:i95z:e onote: fexpanded from macro 'IMPL_COLL_FUNC'( T)) { 391| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn Work, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren c, ty p666e | , F u n c # # dpervirmesd(otpih,r eNaCdCsLG_aAtLhGeOr_,# #dailrgeoc,t -N>CuCpL,_ PNRUOLTLO,_ #a#rpgrso-t>os>e(n)d.bruufnf(,& nacrcglsS-h>mreemc.vwbourfkf),; \| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::202562::5315:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | t i dR(utniWdo)r,k Enltehmreenatdr(e)a.drIudnx(.wxe)),; g r| o ^u p(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppu:p7):,1 : | note: ^~~~~~~~~~~~~~~~~in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 5627: | 60I:M Pnote: Lfield 'group' will be initialized after field 'stepSize'_ COLL_ F562U | N C ( A tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ llReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562391: | 15 : Rwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]n Workd,I nNBClCoLc_kA(LtGhOr_e#a#daIldgxo.,x )N,C CgLr_oPuRpO(TgOr_o#u#pp)r,o t o| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( ) .| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u n(&nc c563l | S h m e ms.tweoprSki)z;e (\n c c| l ^S hmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:o562m:m15.:b unote: ffield 'nthreads' will be initialized after field 'tidInBlock'f Size s562[ | N C C L _tPiRdO(TtOi_dS)I,M PnLtEh]r/eNaCdCsL(_nStThErPeSa/dssi)z,e otfi(dTI)n)B l{o c k| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa dIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r666o:u9p:( gnote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo up), 666| | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 :p rnote: ifield 'group' will be initialized after field 'stepSize'm s(tid ,562 | n T h r etaidds(Gtaitdh)e,r ,n tdhirreeacdts-(>nutph,r eNaUdLsL),, atrigdsI-n>Bsleoncdkb(utfhfr,e aadrIgdsx-.>xr)e,c vgbruofufp,( g r| o ^u p), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#15a:l gwarning: oinitializer order does not match the declaration order [-Wreorder-ctor], NCCL_PRO T562O | _ # # p rtoitdo(>t(i)d.)r,u nn(t&hnrcecaldSsh(mnetmh.rweoardks));, \t i d| I ^n Bloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock'I dx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s(nth r563e | a d s ) ,s tteipdSIinzBel(oncckc(ltShhrmeeamd.Icdoxm.mx.)b,u fgfrSoiuzpe(sg[rNoCuCpL)_,P R O| T ^~~~~~~~~~~~~~~~~O _S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L60E:] /note: Nfield 'group' will be initialized after field 'stepSize'C CL_ST E562P | S / s i zteiodf((tTi)d)) ,{ n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups (nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,687 :t11i:d Inote: nin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereB lock(t h687r | e a d I d x . x ) , pgrriomusp((tgirdo-utpi)d,S t a| r ^~~~~~~~~~~t Bcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid)h, nthmreema.dwso(rnkt)h;r e\a d s| ) ^, tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Inote: dfield 'nthreads' will be initialized after field 'tidInBlock'x .x), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h read s563( | n t h r esatdesp)S,i ztei(dnIcncBllSohcmke(mt.hcroemamd.Ibduxf.fxS)i,z egsr[oNuCpC(Lg_rPoRuOpT)O,_ S I| M ^~~~~~~~~~~~~~~~~P LE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C60L:_ Snote: Tfield 'group' will be initialized after field 'stepSize'E PS/si z562e | o f ( T )t)i d{( t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | n group(groupt hreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 666t:i9d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(t h666r | e a d I d x . x )p,r igmrso(utpi(dg,r onuTph)r,e a d| s ^~~~~~~~~~~G ather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:f562(:T15):) warning: {initializer order does not match the declaration order [-Wreorder-ctor] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d687):,11 :n tnote: hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads( n687t | h r e a d s ) , t ipdrIinmBsl(otcikd(-tthirdeSatdaIrdtxB.cxa)s,t ,g rnoTuhpr(egardosuBpc)a,s t ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~& d i| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ct->o u563t | , n u lsltpetprS,i zaer(gnsc-c>lsSehnmdebmu.fcfo,m ma.rbgusf-f>Sriezcevsb[uNfCfC,L _ P| R ^O TO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:L202E:]53/:N Cnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL _STE P202S | / s i z e o f ( TR)u)n W{o r k| E ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l e m| e group(groupn t, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel go, Pr o687t | o > ( ) . r u n ( w ep)r;i m s| ( ^t id-tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppS:t7a:r1t:B cnote: ain instantiation of member function 'RunWork, 2, 2>::run' requested heres t, n T7h | rIeMaPdLs_BCcOaLsLt_,F U&NdCi(rAelcltR-e>douucte,, nCuOlLlLpNtErT,_ DaIrRgEsC-T>,s eSnIdMbPuLfEf,, Saurmg,s -u>irnetc3v2b_utf)f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::202 :note: 53expanded from macro 'IMPL_COLL_FUNC': note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 391 | 202 | R u n W o r k e(>),. rNuCnC(Lw_eA)L;G O _| # ^# algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppN:C8C:L1_:P Rnote: Oin instantiation of member function 'RunWork, 2, 2>::run' requested hereT O_## p8roto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStart/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i15d:I nwarning: Binitializer order does not match the declaration order [-Wreorder-ctor]l ock(thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ead s563) | , t i dsItneBplSoiczke((tnhcrcelaSdhImdexm..xc)o,m mg.rbouufpf(Sgirzes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdBxc.axs)t,, gnrTohurpe(agdrsoBucpa)s,t , | & ^~~~~~~~~~~~~~~~~d irec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:-562>:o60u:t ,note: field 'group' will be initialized after field 'stepSize'n ullpt r562, | a r g st-i>ds(etniddb)u,f fn,t harregasd-s>(rnetchvrbeuafdfs,) , | t ^i dInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :g53r:o unote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( group), 202 | | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e15m:. cwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]m m.buffSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:p641(:g11r:o unote: pin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 641 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | p r i m ss(tteipdS-itzied(SntcacrltSRhemdeumc.ec,o mnmT.hbruefafdSsiRzeedsu[cNeC,C Ld_iPrReOcTtO-_>SdIoMwPnL,E ]&/dNiCrCeLc_tS-T>EoPuSt/,s iazregosf-(>Ts)e)n d{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 666 :| 9 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202666: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here p202r | i m s ( t i d , RnuTnhWroerakdEslGeamtehnetr<,F nd,i rTe,c tR-e>duOpp,, NAUlLgLo,, aPrrgost-o>>s(e)n.drbuunf(fw,e )a;r g s| - ^> recvb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppu:f7f:,1 : | note: ^in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:L202_:C53O:L Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereF UNC(A l202l | R e d u c e , CROuLnLWNoErTk_EDlIeRmEeCnTt,< FSnI,M PTL,E ,R eSduOmp,, uAilngto3,2 _Ptr)o t o| >^( ).ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(391w:e95):; note: expanded from macro 'IMPL_COLL_FUNC'| ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :R6u:n1W:o rnote: kin instantiation of member function 'RunWork, 2, 2>::run' requested here< ncclF u6n | cI#M#PfLu_nCcO,L Lt_yFpUeN,C (FAulnlcR#e#dduecver,e dCoOpLD,I RNECCCTL,_ ASLIGMOP_L#E#,a lSguom,, NiCnCtL3_2P_RtO)T O _| #^# prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:>391(:)95.:r unote: nexpanded from macro 'IMPL_COLL_FUNC'( &ncclShm e391m | . w oRrukn)W;o r\k < n| c ^c lFun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:#562#:f15u:n cnote: ,field 'nthreads' will be initialized after field 'tidInBlock' type ,562 | F u n c #t#idde(vtriedd)o,p a,d sN(CnCtLh_rAeLaGdOs_)#,# atligdoI,n BNlCoCcLk_(PtRhOrTeOa_d#I#dpxr.oxt)o,> (g)r.oruupn((g&rnocucpl)S,hmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bwarning: linitializer order does not match the declaration order [-Wreorder-ctor]o ck(threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads), 563t | i d I n BsltoecpkS(itzher(enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C L _| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R OTO_S I563M | P L E ] /sNtCeCpLS_iSzTeE(PnSc/csliSzhemoefm(.Tc)o)m m{. b u| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f S i| z group(groupe s[NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:I641M:P11L:E ]note: /in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereN CCL_S T641E | P S / s i z e o f ( Tp)r)i m{s ( t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d - t| i group(groupd StartRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:u655c:e11,: nnote: Tin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads R655e | d u c e , d i r e cptr-i>mdso(wtni,d -&tdiidrSetcatr-t>Roeudtu,c ea,r gnsT-h>rseeanddsbRuefdfu,c ea,r gnsu-l>lrpetcrv,b u&fdfi,r e c| t ^- >out, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:g202s:-53>:s enote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered buff ,202 | a r g s - > r e cRvubnuWfofr,k E l| e ^m ent, 2, 2>::run' requested herep , Al g202o | , P r o t o > (R)u.nrWuonr(kwEel)e;m e n| t ^< Fn, T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 8R:e1d:O pnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here Alg o8, | IPMrPoLt_oC>O(L)L._rFuUnN(Cw(eA)l;l R e| d ^u ce, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppL:L8N:E1T:_ Dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hereR ECT, 8S | IIMMPPLLE_,C OSLuLm_,F UiNnCt(6A4l_ltR)e d u| c^e , CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:L391N:E95T:_ Dnote: Iexpanded from macro 'IMPL_COLL_FUNC'R ECT, S391I | M P LREu,n WSourmk,< nicnctl6F4u_ntc)# # f| u^n c, t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hy:p391e:,95 :F unote: nexpanded from macro 'IMPL_COLL_FUNC'c ##dev r391e | d o pRk,< nNcCcClLF_uAnLcG#O#_f#u#nacl,g ot,y pNeC,C LF_uPnRcO#T#Od_e#v#rperdootpo<>t(y)p.er>u,n (N&CnCcLc_lASLhGmOe_m#.#waolrgko),; N\C C L| _ ^P ROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:#562p:r15o:t onote: >field 'nthreads' will be initialized after field 'tidInBlock'( ).run (562& | n c c l Sthimde(mt.iwdo)r,k )n;t h\r e a| d ^s (nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:) ,note: field 'nthreads' will be initialized after field 'tidInBlock't idInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l60o:c knote: (field 'group' will be initialized after field 'stepSize't hread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~( nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60s:) ,note: field 'group' will be initialized after field 'stepSize't idIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~t idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562):,15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]I nBlock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t60i:d Inote: nfield 'group' will be initialized after field 'stepSize'B lock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), tidI n563B | l o c k (sttherpeSaidzIed(xn.cxc)l,S hgmreomu.pc(ogmrmo.ubpu)f,f S i| z ^~~~~~~~~~~e s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement((t)i.dr)u,n (nwteh)r;e a d| s ^( nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :t9i:d1I:n Bnote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereo ck(thr e9a | dIIMdPxL._xC)O,L Lg_rFoUuNpC((gArloluRpe)d,u c e| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ C O| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L NET_ D563I | R E C T ,s tSeIpMSPiLzEe,( nScucml,S humienmt.6co4m_mt.)b u f| f^S izes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C391L:_95P:R Onote: Texpanded from macro 'IMPL_COLL_FUNC'O _SIMPLE]/ N391C | C L _RSuTnEWPoSr/ks, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo p | , N C C L _ A L G Op_r#i#masl(gtoi,d -NtCiCdLS_tPaRrOtTROe_d#u#cper,o tnoT>h(r)e.ardusnR(e&dnucccel,S hnmuelml.pwtorr,k )&;d i\r e c| t ^- >out, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-15>:s enote: nfield 'nthreads' will be initialized after field 'tidInBlock'd buff, a562r | g s - > rtid(teicdv)b,u fnft,h r e| a ^d s(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202):,53 :t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereI nBl o202c | k ( t h r e a d IRduxn.Wxo)r,k Eglreomuepn(tg().r u562n | ( w e ) ;t i d| ( ^t id), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppn:t7h:r1e:a dnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here( nthr e7a | dIsM)P,L _tCiOdLILn_BFlUoNcCk((AtlhlrReeadduIcdex,. xC)O,L LgNrEoTu_pD(IgRrEoCuTp,) ,S I M| P ^~~~~~~~~~~L E, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:iz562e:(15n:c cwarning: linitializer order does not match the declaration order [-Wreorder-ctor]S hmem.comm.bu f562f | S i z e st[iNdC(CtLi_dP)R,O TnOt_hSrIeMaPdLsE(]n/tNhCrCeLa_dSsT)E,P St/isdiIzneBolfo(cTk)()t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I d x| . group(groupx ), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u655p:)11,: note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 655 | 563 | sptreipmSsi(ztei(dn-ctcildSShtmaermt.Rceodmumc.eb,u fnfTShirzeeasd[sNRCeCdLu_cPeR,O TnOu_lSlIpMtPrL,E ]&/dNiCrCeLc_tS-T>EoPuSt/,s iazregosf-(>Ts)e)n d{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->recvb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:f641f:,11 : | note: ^in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202641: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202p | r i m s ( t i d -RtuindWSotrakrEtlReemdeuncte<,F nn,T hTr,e aRdesdROepd,u cAel,g od,i rPercott-o>>d(o)w.nr,u n&(dwier)e;c t -| > ^o ut, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpps:-7>:s1e:n dnote: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ff, a7r | gIsM-P>Lr_eCcOvLbLu_fFfU,N C (| A ^l lReduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202C:O53L:L Nnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT _DIR E202C | T , S I M P L ER,u nSWuomr,k Eulienmte3n2t_ | ( ) .RruunnW(owrek)<;n c c| l ^F unc##f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppu:n8c:,1 :t ynote: pin instantiation of member function 'RunWork, 2, 2>::run' requested heree , Fu n8c | #I#MdPeLv_rCeOdLoLp_A,l lNRCeCdLu_cAeL,G OC_O#L#LaNlEgTo_,D INRCECCLT_,P RSOITMOP_L#E#,p rSoutmo,> (i)n.tr6u4n_(t&)n c c| l^S hmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:w391o:r95k:) ;note: expanded from macro 'IMPL_COLL_FUNC'\ | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :R562u:n15W:o rnote: kfield 'nthreads' will be initialized after field 'tidInBlock'< ncclF u562n | c # # f utnicd,( ttiydp)e,, nFtuhnrce#a#ddse(vnrtehdroepat,i dNICnCBLl_oAcLkG(OthreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ id), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 666warning: :initializer order does not match the declaration order [-Wreorder-ctor]9 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 666 | t i d ( t ipdr)i,m sn(tthirde,a dnsT(hnrtehardesaGdast)h,e rt,i ddIinrBelcotc-k>(utph,r eNaUdLILd,x .axr)g,s -g>rsoeunpd(bgurfofu,p )a,r g s| - ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~> r e| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)v buff ,563 | | ^ stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e202(:n53c:c lnote: Sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh mem. c202o | m m . b u f f S iRzuensW[oNrCkCELl_ePmReOnTtO<_FSnI,M PTL,E ]R/eNdCOCpL,_ SATlEgPoS,/ sPirzoetoof>((T)).)r u{n ( w| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h8::6661::9 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 8 | 666I | M P L _ C O L L _pFrUiNmCs((AtlildR,e dnuTcher,e aCdOsLGLaNtEhTe_rD,I RdEiCrTe,c tS-I>MuPpL,E ,N USLuLm,, airngts6-4>_ste)n d b| u^f f, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:s391-:>95r:e cnote: vexpanded from macro 'IMPL_COLL_FUNC'b uff, 391| | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k202<:n53c:c lnote: Fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu nc## f202u | n c , t y p e ,R uFnuWnocr#k#Edleevmreendto ,R eNdCOCpL,_ AALlGgOo_,# #Parlogtoo,> (N)C.CrLu_nP(RwOeT)O;_ # #| p ^r oto>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppr:u7n:(1&:n cnote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herel Shmem .7w | oIrMkP)L;_ C\O L L| _ ^F UNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:l562R:e15d:u cnote: efield 'nthreads' will be initialized after field 'tidInBlock', COL L562N | E T _ D ItRiEdC(Tt,i dS)I,M PnLtEh,r eSaudms,( nutihnrte3a2d_st)), t| i^d InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k391(:t95h:r enote: aexpanded from macro 'IMPL_COLL_FUNC'd Idx.x) ,391 | g r oRuupn(Wgorrokuh,r eNaCdCsL(_nAtLhGrOe_a#d#sa)l,g ot,i dNICnCBLl_oPcRkO(TtOh_r#e#apdrIodtxo.>x()),. rgurno(u&pn(cgcrloSuhpm)e,m . w| o ^~~~~~~~~~~r k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize'_ 562 | # # a l gtoi,d (NtCiCdL)_,P RnOtThOr_e#a#dpsr(onttoh>r(e)a.drsu)n,( &tnicdcIlnSBhlmoecmk.(wtohrrke)a;d I\d x .| x ^) , group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC L_PROT O677_ | S I M P L E ] / N C CpLr_iSmTsE(PtSi/ds-itziedoSft(aTr)t)B c{a s t| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n T| h group(groupr eadsBcast,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :&626d:i9r:e cnote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >out, 626d | i r e c t - > d opwrni,m sa(rtgisd-->tsiednSdtbaurftfS,c aatrtgesr-,> rneTchvrbeuafdfs,S c a| t ^t er, NULL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202d:i53r:e cnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here- >up, a202r | g s - > s e n d bRuufnfW,o rakrEglse-m>ernetc, 2, 2>::run' requested heret o>( )202. | r u n ( w e ) ; R u| n ^W orkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppt:<7F:n1,: Tnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here RedO p7, | IAMlPgLo_,C OPLrLo_tFoU>N(C)(.ArlulnR(ewdeu)c;e , | C ^O LLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp_:D7I:R1E:C Tnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here SIM P7L | EI,M PSLu_mC,O LuLi_nFtU3N2C_(tA)l l R| e^d uce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:O391L:L95N:E Tnote: _expanded from macro 'IMPL_COLL_FUNC'D IRECT, S391I | M P LREu,n WSourmk,< nucicnltF3u2n_ct#)# f u| n^c , ty/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:e391,: 95F:u nnote: cexpanded from macro 'IMPL_COLL_FUNC'# #devre d391o | p < tRyupneW>o,r kNp(<)t.yrpuen>(,& nNcCcClLS_hAmLeGmO._w#o#rakl)g;o ,\ N C| C ^L _PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:O562_:#15#:p rnote: ofield 'nthreads' will be initialized after field 'tidInBlock't o>() .562r | u n ( & ntcicdl(Sthimde)m,. wnotrhkr)e;a d\s ( n| t ^h reads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i15d:I nnote: Bfield 'nthreads' will be initialized after field 'tidInBlock'l ock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Bloc k562( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s ), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n60B:l onote: cfield 'group' will be initialized after field 'stepSize'k (thre a562d | I d x . xtid(ti)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~d s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~r eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r15k:) ;warning: initializer order does not match the declaration order [-Wreorder-ctor]\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup) ,563 | | ^~~~~~~~~~~~~~~~~ st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:p562S:i60z:e (note: nfield 'group' will be initialized after field 'stepSize'c clShm e562m | . c o m mt.ibdu(ftfiSdi)z,e sn[tNhCrCeLa_dPsR(OnTtOh_rSeIaMdPsL)E,] /tNiCdCILn_BSlToEcPkS(/tshirzeeaodfI(dTx).)x ){, g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ( group(groupg roup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~: 687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562_:C15O:L Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]F UNC(AllR e562d | u c e , tCiOdL(LtNiEdT)_,D InRtEhCrTe,a dSsI(MnPtLhEr,e aSdusm),, utiindtI6n4B_lto)c k (| t^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x391:.95x:) ,note: expanded from macro 'IMPL_COLL_FUNC'g roup(g r391o | u p )R,u n W| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r k <| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c clFun c563# | # f u n cs,t etpySpiez,e (Fnucnccl#S#hdmeevmr.ecdoompm<.tbyupfef>S,i zNeCsC[LN_CACLLG_OP_R#O#TaOl_gSoI,M PNLCEC]L/_NPCRCOLT_OS_T#E#PpSr/ostioz>e(o)f.(rTu)n)( &{n c c| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S h m| e group(groupm .work); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 626 ^: 9: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'626 | 562 | p r itmisd((ttiidd-)t,i dnSttharretaSdcsa(tnttehrr,e andTsh)r,e atdisdSIcnaBtltoecrk,( tNhUrLeLa,d Iddixr.exc)t,- >gurpo,u pa(rggrso-u>ps)e,n d b| u ^~~~~~~~~~~~~~~~~f f, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-60>:r enote: cfield 'group' will be initialized after field 'stepSize'v buff, 562 | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202):,53 :n tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eads (202n | t h r e a d s ) ,R utniWdoIrnkBElloecmke(ntth (| ) ^~~~~~~~~~~. run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, Su:m562,: 15u:i nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]6 4_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 391 : 95 : tnote: iexpanded from macro 'IMPL_COLL_FUNC'd (tid), n391t | h r eRaudnsW(onrtkhu,p )N,C C L| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~A L G| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ##alg o563, | N C C Ls_tPeRpOSTiOz_e#(#npcrcoltSoh>m(e)m..rcuonm(m&.nbcucflfSShimzeems.[wNoCrCkL)_;P R\O T O| _ ^S IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:]562/:N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'S TEPS /562s | i z e o ft(iTd)()t i{d ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641t:i11d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(t h641r | e a d I d x . x ) , pgrriomusp((tgirdo-utpi)d,S t a| r ^~~~~~~~~~~~~~~~~t Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:c562e:,60 :n Tnote: hfield 'group' will be initialized after field 'stepSize'r eads R562e | d u c e ,t iddi(rteicdt)-,> dnotwhnr,e a&ddsi(rnetchtr-e>aodust),, atrigdsI-n>Bsleoncdkb(utfhfr,e aadrIgdsx-.>xr)e,c vgbruofufp,( g r| o ^u p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] n(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ buff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i dR)u,n WnotrhkrEelaedmse(nnttI(d)x..rxu)n,( wger)o;u p (| g ^r oup),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 10| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~1 : | note: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | 563I | M P L _ CsOtLeLp_SFiUzNeC((nAclcllRSehdmuecme.,c oCmOmL.LbNuEfTf_SDiIzReEsC[TN,C CSLI_MPPRLOET,O _SSuImM,P LhEa]l/fN)C C L| _^S TEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/:s391i:z95e:o fnote: (expanded from macro 'IMPL_COLL_FUNC'T )) { 391| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn Work, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec , type ,655 | F u n c # # d e v r epdroipm-,t iNdCSCtLa_rAtLRGeOd_u#c#ea,l gnoT,h rNeCaCdLs_RPeRdOuTcOe_,# #npurloltpot>r(,) .&rduinr(e&cntc-c>loSuhtm,e ma.rwgosr-k>)s;e n\d b u| f ^f , ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562-:>15r:e cnote: vfield 'nthreads' will be initialized after field 'tidInBlock'b uff, 562| | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:)53,: nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh reads (202n | t h r e a d s ) ,R utniWdoIrnkBElloecmke(ntth (| ) ^~~~~~~~~~~~~~~~~. run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:w562e:)60;: note: | field 'group' will be initialized after field 'stepSize' ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp : 9 :t1i:d (note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herei d), n9t | hIrMePaLd_sC(OnLtLh_rFeUaNdCs()A,l ltRieddIuncBel,o cCkO(LtLhNrEeTa_dDIIdRxE.CxT),, SgIrMoPuLpE(,g rSouump,) ,u i n| t ^~~~~~~~~~~6 4_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorks,( nNtChCrLe_aAdLsG)O,_ #t#iadlIgnoB,l oNcCkC(Lt_hPrReOaTdOI_d#x#.pxr)o,t og>r(o)u.pr(ugnr(o&unpc)c,l S h| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e m .| w tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rk); \ 563 | | ^ ste/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:S562i:z15e:( nnote: cfield 'nthreads' will be initialized after field 'tidInBlock'c lShme m562. | c o m m .tbiudf(ftSiidz)e,s [nNtChCrLe_aPdRsO(TnOt_hSrIeMaPdLsE)],/ NtCiCdLI_nSBTlEoPcSk/(stihzreeoafd(ITd)x). x{) , | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~~~~~~~666 :9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 60: note: field 'group' will be initialized after field 'stepSize'666 | 562 | p r itmisd((ttiidd,) ,n TnhtrheraedasdGsa(tnhtehrr,e addisr)e,c tt-i>duIpn,B lNoUcLkL(,threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE, S:u562m:,15 :u iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t 64_t) | ^ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'( tid), n t391h | r e aRdusn(Wnotrhkrp,) ,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ A L| G tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O _##al g563o | , N C CsLt_ePpRSOiTzOe_(#n#cpcrloSthom>e(m)..croumnm(.&bnucfcflSSihzmeesm[.NwCoCrLk_)P;R O\T O _| S ^I MPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:S Tnote: Efield 'nthreads' will be initialized after field 'tidInBlock'P S/siz e562o | f ( T ) )t i{d ( t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ) ,| group(groupn threads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:)677,: 11t:i dnote: Iin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren Block( t677h | r e a d I d x . x ) ,p rgirmosu(pt(igdr-otuipd)S,t a r| t ^~~~~~~~~~~~~~~~~B cast/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:T60h:r enote: afield 'group' will be initialized after field 'stepSize'd sBcas t562, | & d i rteicdt(-t>iodu)t,, ndtihrreecatd-s>(dnotwhnr,e aadrsg)s,- >tsiednIdnbBulfofc,k (atrhgrse-a>drIedcxv.bxu)f,f ,g r o| u ^p (group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^~~~~~~~~~~53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(All/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hRedu:c562e:,15 :C Owarning: Linitializer order does not match the declaration order [-Wreorder-ctor]L NET_DIRECT, S562I | M P L E ,t iSdu(mt,i du)i,n tn6t4h_rte)a d s| (^n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:s95):, note: texpanded from macro 'IMPL_COLL_FUNC'i dInBl o391c | k ( tRhurneWaodrIkdS,i zNeC(CnLc_cAlLSGhOm_e#m#.aclogmom,. bNuCfCfLS_iPzReOsT[ON_C#C#Lp_rPoRtOoT>O(_)S.IrMuPnL(E&]n/cNcClCSLh_mSeTmE.PwSo/rski)z;e o\f ( T| ) ^) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~562 : 15| : group(group note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t641i:d11(:t inote: din instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , nthr e641a | d s ( n t h r e a d sp)r,i mtsi(dtIindB-ltoicdkS(ttahrrteRaeddIudcxe.,x )n,T hgrreoaudps(Rgerdouucpe),, d i| r ^~~~~~~~~~~~~~~~~e ct-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:d562o:w60n:, note: &field 'group' will be initialized after field 'stepSize'd irec t562- | > o u t ,t iadr(gtsi-d>)s,e nndtbhurfefa,d sa(rngtsh-r>eraedcsv)b,u ftfi,d I n| B ^l ock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202I:d53x:. xnote: ) ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562 | : 562 :t15i:d warning: initializer order does not match the declaration order [-Wreorder-ctor] (tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(gro u563p | ) , | s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t e p| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i ze(n c563c | l S h m esmt.ecpoSmimz.eb(unfcfcSliSzhemse[mN.CcCoLm_mP.RbOuTfOf_SSiIzMePsL[EN]C/CNLC_CPLR_OSTTOE_PSSI/MsPiLzEe]o/fN(CTC)L)_ S{T E P| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/ s i| z group(groupe of(T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~11 : | note: group(groupin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 666 : 9 : pnote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei ms(ti d666- | t i d S t a r t Rperdiumcse(,t indT,h rneTahdrseRaeddsuGcaet,h enru,l ldpitrre,c t&-d>iurpe,c tN-U>LoLu,t ,a ragrsg-s>-s>esnednbdubfuff,f ,a ragrsg-s>-r>ercevcbvubfuff,f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp::119::11:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herein instantiation of member function 'RunWork, 2, 2>::run' requested here 119 | | IIMMPPLL__CCOOLLLL__FFUUNNCC((AAllllRReedduuccee,, CCOOLLLLNNEETT__DDIIRREECCTT,, SSIIMMPPLLEE,, SSuumm,, uint6f4l_ota)t ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h391::39195::95 :note: expanded from macro 'IMPL_COLL_FUNC'note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391 | RunWo r kRp,< tNyCpCeL>_,A LNGCOC_L#_#AaLlGgOo_,# #NaClCgLo_,P RNOCTCOL__#P#RpOrToOt_o#>#(p)r.ortuon>((&)n.crculnS(h&mnecmc.lwSohrmke)m;. w\o r k| ) ^; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock': 562:15: note: 562field 'nthreads' will be initialized after field 'tidInBlock' | t562i | d ( t i dt)i,d (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~~~~~~~p ), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~~~~~~~~~~~: 60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :field 'group' will be initialized after field 'stepSize'562 :60: note: field 'group' will be initialized after field 'stepSize'562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~) , | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ vbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | ^~~~~~~~~~~~~~~~~ :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :warning: 60initializer order does not match the declaration order [-Wreorder-ctor]: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h10::2021::53 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 20210 | | I M P L _ C O LRLu_nFWUoNrCk(EAllelmReendtu,( )S.urmu,n (hwael)f;) | | ^^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:: 11note: :expanded from macro 'IMPL_COLL_FUNC'1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 39111 | | I MRPuLn_WCoOrLkL<_nFcUcNlCF(uAnlcl#R#efduuncce,, tCyOpLeL,N EFTu_nDcI#R#EdCeTv,r eSdIoMpPu,m ,N CfClLo_aAtL)G O _| #^# algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391N:C95C:L _note: Pexpanded from macro 'IMPL_COLL_FUNC'R OTO_## p391r | o t oR>u(n)W.orrukn<(n&cncclcFluSnhcm#e#mf.uwnocr,k )t;y p\e , | F ^u nc##devre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:o562p:<15t:y pnote: efield 'nthreads' will be initialized after field 'tidInBlock'> , NCCL _562A | L G O _ #t#iadl(gtoi,d )N,C CnLt_hPrReOaTdOs_(#n#tphrroetaod>s()),. rtuind(I&nnBclcolcSkh(mtehmr.ewaodrIkd)x;. x\) , | g ^r oup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)15,: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 60 : note: tfield 'group' will be initialized after field 'stepSize'i d(tid )562, | n t h rt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hei:ad562d(:st15(i:nd t)warning: h,initializer order does not match the declaration order [-Wreorder-ctor]r enatdhs r)562e, | a dt si (d nIttnihBdrl(eotacidkds())t,,h rntetiahddrIIendaBxdl.sox(c)nk,t( htgrhreroaeudapsd()Ig,dr xot.uixpd))I,,n Bg lr| oo ^~~~~~~~~~~~~~~~~cu kp((tg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhr:ro562eu:ap60d):I,d xnote: .field 'group' will be initialized after field 'stepSize'| x ^~~~~~~~~~~) , g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads(n t563h | r e a d ss)t,e ptSiidzIen(BnlcocclkS(htmherme.acdoImdmx..bxu)f,f Sgirzoeusp[(NgCrCoLu_pP)R,O T O| _ ^~~~~~~~~~~S IMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ c, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, CO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]( tid), nth r562e | a d s ( nttihdr(etaidds)),, nttihdrIenaBdlso(cnkt(htrheraedasd)I,d xt.ixd)I,n Bglroocukp((tghrroeuapd)I,d x .| x ^~~~~~~~~~~~~~~~~) , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562g:r60o:u pnote: (field 'group' will be initialized after field 'stepSize'g roup )562, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( tid), n563t | h r e a dsst(enptShirzeea(dnsc)c,l SthimdeImn.Bcloomcmk.(btuhfrfeSaidzIedsx[.NxC)C,L _gPrRoOuTpO(_gSrIoMuPpL)E,] / N| C ^~~~~~~~~~~C L_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | ^ :562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ P| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O TO_## p563r | o t o > (s)t.erpuSni(z&en(cncclcSlhSmhemme.mw.ocrokm)m;. b\u f f| S ^i zes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R15O:T Onote: _field 'nthreads' will be initialized after field 'tidInBlock'S IMPL E562] | / N C C Lt_iSdT(EtPiSd/)s,i znetohfr(eTa)d)s ({n t h| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e a d| s group(group) , tidInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I677d:x11.:x )note: ,in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here group( g677r | o u p ) , | ^~~~~~~~~~~~~~~~~ p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s60(:t inote: dfield 'group' will be initialized after field 'stepSize'- tidS t562a | r t B c atsitd,( tniTdh)r,e andtshBrceaasdts,( n&tdhirreeacdts-)>,o utti,d IdniBrleocctk-(>tdhorwena,d Iadrxg.sx-)>,s egnrdobuupf(fg,r oaurpg)s,- > r| e ^~~~~~~~~~~c vbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95:: 562note: :expanded from macro 'IMPL_COLL_FUNC'15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | RunWork <562n | c c l F utnicd#(#tfiudn)c,, nttyhpree,a dFsu(nnct#h#rdeeavdrse)d,o ptl,o cNkC(CtLh_rAeLaGdOI_d#x#.axl)g,o ,g rNoCuCpL(_gPrRoOuTpO)_,# # p| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o t o| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( ).ru n563( | & n c c lsSthempeSmi.zweo(rnkc)c;l S\h m e| m ^. comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:b562u:f15f:S inote: zfield 'nthreads' will be initialized after field 'tidInBlock'e s[NCC L562_ | P R O T Ot_iSdI(MtPiLdE)],/ NnCtChLr_eSaTdEsP(Sn/tshirzeeaodfs()T,) )t i{d I n| B ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l o c| k group(group( threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u666p:(9g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep ), | ^~~~~~~~~~~~~~~~~666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 : note: pfield 'group' will be initialized after field 'stepSize'r ims( t562i | d , n Tthirde(atdisdG)a,t hnetrh,r edaidrse(cntt-h>ruepa,d sN)U,L Lt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:,562 :S15u:m ,warning: initializer order does not match the declaration order [-Wreorder-ctor]h alf) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nt h391r | e a dRsu(nnWtohrrke),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ A| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)G O_##a l563g | o , N CsCtLe_pPSRiOzTeO(_n#c#cplrSohtmoe>m(.)c.ormumn.(b&unfcfcSliSzhemse[mN.CwCoLr_kP)R;O T\O _ S| I ^M PLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C15L:_ Snote: Tfield 'nthreads' will be initialized after field 'tidInBlock'E PS/si z562e | o f ( T )t)i d{( t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | n group(groupt hreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e677a:d11s:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret idInBl o677c | k ( t h r e a d I d xp.rxi)m,s (gtriodu-pt(igdrSotuapr)t,B c a| s ^~~~~~~~~~~~~~~~~t , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:T562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize'B cast ,562 | & d i r etcitd-(>toiudt),, dnitrherceta-d>sd(onwtnh,r eaardgss)-,> steinddIbnuBflfo,c ka(rtghsr-e>ardeIcdvxb.uxf)f,, g r| o ^u p(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202):,53 : | note: ^~~~~~~~~~~in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o uwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | t563i | d ( t i ds)t,e pnStihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i z e| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f (T)) 563{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ s t| e group(groupp Size(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:.626c:o9m:m .note: bin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu ffSize s626[ | N C C L _ P R O TpOr_iSmIsM(PtLiEd]-/tNiCdCSLt_aSrTtESPcSa/tstiezre,o fn(TTh)r)e a{d s S| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a tter, NULL, d| i group(groupr ect->up, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:s666-:>9s:e nnote: din instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herebuff, arg s666- | > r e c v b u f fp,r i m| s ^( tid, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:h202r:e53a:d snote: Gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ther ,202 | d i r e c t - > uRpu,n WNoUrLkLE,l eamregnst-<>Fsne,n dTb,u fRfe,d Oapr,g sA-l>groe,c vPbruoftfo,> ( )| . ^r un(we); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^: 202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 12:1: note: 202in instantiation of member function 'RunWork, 2, 2>::run' requested here | 12 | I MRPuLn_WCoOrLkLE_lFeUmNeCn(tAS(I)M.PrLuEn,( wSeu)m;, d| o ^u ble) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp^: 11:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391in instantiation of member function 'RunWork, 2, 2>::run' requested here: 95: note: 11expanded from macro 'IMPL_COLL_FUNC' | IMPL_C O391L | L _ FRUuNnCW(oArlkl), N| C^C L_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hG:O391_:#95#:a lnote: gexpanded from macro 'IMPL_COLL_FUNC'o , NCCL _391P | R O TROu_n#W#oprrkoc(l)F.urnucn#(#&fnucnccl,S htmyepme.,w oFrukn)c;# #\d e v| r ^e dop15,: Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_ALG O562_ | # # a l gtoi,d (NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ irect->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | pri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group ms(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hims(:t562i:d15-:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]S tartScatter, n562T | h r e a dtsiSdc(attitde)r,, nNtUhLrLe,a ddsi(rnetchtr-e>audps,) ,a rtgisd-I>nsBelnodcbku(ftfh,r eaardgIsd-x>.rxe)c,v bgurfofu,p ( g| r ^o up),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~53 : | note: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202563 | | s t e pRSuinzWeo(rnkcEclleSmhemnetm<.Fcno,m mT.,b uRfefdSOipz,e sA[lNgCoC,L _PPrRoOtToO>_(S)I.MrPuLnE(]w/eN)C;C L _| S ^T EPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppi:z13e:o1f:( Tnote: )in instantiation of member function 'RunWork, 2, 2>::run' requested here) { | 13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | I M| P group(groupL _COLL_FUNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:R641e:d11u:c enote: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here COLLNE T641_ | D I R E C T , S I MpPrLiEm,s (Stuimd,- tricdcSlt_abrftlRoeadtu1c6e), n| T^h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391R:e95d:u cnote: eexpanded from macro 'IMPL_COLL_FUNC', dire c391t | - > dRouwnnW,o r&kdnocu#t#,f uanrcg,s -t>yspeen,d bFuufnfc,# #adregvsr-e>droepc,, N| C ^C L_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:l202g:o53,: Nnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC L_PR O202T | O _ # # p r o t oR>u(n)W.orruknE(l&enmcecnltS:( )note: .field 'nthreads' will be initialized after field 'tidInBlock'r un(w e562) | ; | ^t id(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 11n:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered s(nt h11r | eIaMdPsL)_,C OtLiLd_IFnUBNlCo(cAkl(ltRherdeuacdeI,d xC.OxL)L,N EgTr_oDuIpR(EgCrTo,u pS)I,M P L| E ^~~~~~~~~~~~~~~~~, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:m562,: float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, ar60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: 562note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here t i11d | (ItMiPdL)_,C OnLtLh_rFeUaNdCs((AnltlhRreedaudcse),, CtOiLdLINnEBTl_oDcIkR(EtChTr,e aSdIIMdPxL.Ex,) ,S ugmr,o ufpl(ogarto)u p )| ,^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 391 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 95: note: expanded from macro 'IMPL_COLL_FUNC' 563 | s391t | e p SRiuzneW(onrckcC,C LN_CSCTLE_PASL/GsOi_z#e#oafl(gTo),) N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P R O| T group(groupO _##proto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):.655r:u11n:( ¬e: nin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec clShmem .655w | o r k ) ; \ | ^p rims(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562-:t15i:d Snote: tfield 'nthreads' will be initialized after field 'tidInBlock'a rtRed u562c | e , n Tthirde(atdisdR)e,d uncteh,r enaudlsl(pnttrh,r e&addisr)e,c tt-i>doIuntB,l oacrkg(st-h>rseeanddIbduxf.fx,) ,a rggrso-u>pr(egcrvobuupf)f,, | | ^~~~~~~~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :field 'group' will be initialized after field 'stepSize'53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d ) ,R unntWhorrekaEdlse(mnetnhtrx(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp ^~~~~~~~~~~: 12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.halgo, :N562C:C15L:_ Pwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]O TO_##proto>() .562r | u n ( & ntcicdl(Sthimde)m,. wnotrhkr)e;a d\s ( n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~~~~~~~z es[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L60_:P Rnote: Ofield 'group' will be initialized after field 'stepSize'T O_SI M562P | L E ] / NtCiCdL(_tSiTdE)P,S /nstihzreeoafd(sT()n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ) ,| group(groupt idInBlock(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I666d:x9.:x )note: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here grou p666( | g r o u p ) , p| r ^~~~~~~~~~~i ms(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ redop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_F562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ UNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~~~~~~~: 562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]60 : note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ clShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(ncc:l562S:h15m:e mwarning: .initializer order does not match the declaration order [-Wreorder-ctor]c omm.buffSiz e562s | [ N C C Lt_iPdR(OtTiOd_)S,I MnPtLhEr]e/aNdCsC(Ln_tShTrEePaSd/ss)i,z etoifd(ITn)B)l o{c k (| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd Idx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:u641p:(11g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 641 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | p r ismtse(ptSiidz-et(indcSctlaSrhtmReemd.uccoem,m .nbTuhfrfeSaidzseRse[dNuCcCeL,_ PdRiOrTeOc_tS-I>MdPoLwEn],/ N&CdCiLr_eScTtE-P>So/usti,z eaorfg(sT-)>)s e{n d b| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f f ,| group(groupa rgs->recvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 666 ^: 9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53666: | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here p202r | i m s ( t i d , RnuTnhWroerakdEslGeamtehnetr<,F nd,i rTe,c tR-e>duOpp,, NAUlLgLo,, aPrrgost-o>>s(e)n.drbuunf(fw,e )a;r g s| - ^> recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp,: 11 :| 1 ^: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 20211: | 53I:M Pnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ COL L202_ | F U N C ( A l l RReudnuWcoer,k EClOeLmLeNnEtT<_FDnI,R ETC,T ,R eSdIOMpP,L EA,l gSou,m ,P rfoltooa>t()) . r| u^n (we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^95 : note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp :11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here391 | R11u | nIWMoPrLk_E,, NSCuCmL,_ AfLlGoOa_t#)# a l| g^o , N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C391L:_95P:R Onote: Texpanded from macro 'IMPL_COLL_FUNC'O _##p r391o | t o >R(u)n.Wrournk(<&nnccccllFSuhnmce#m#.fwuonrck,) ;t y\p e ,| ^F unc##d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:v562r:e15d:o pnote: , N562C | C L _ A LtGiOd_(#t#iadl)g,o ,n tNhCrCeLa_dPsR(OnTtOh_r#e#apdrso)t,o >t(i)d.IrnuBn(l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]) , tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:m562.:c15o:m mwarning: .initializer order does not match the declaration order [-Wreorder-ctor]b uffSizes[ N562C | C L _ P RtOiTdO(_tSiIdM)P,L En]t/hNrCeCaLd_sS(TnEtPhSr/esaidzse)o,f (tTi)d)I n{B l o| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k ( t| h group(groupr eadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:o641u:p11(:g rnote: oin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~641 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | p r i m ss(tteipdS-itzied(SntcacrltSRhemducee,m .ncTohmrme.abdusfRfeSdiuzcees,[ NdCiCrLe_cPtR-O>TdOo_wSnI,M P&LdEi]r/eNcCtC-L>_oSuTtE,P Sa/rsgisz-e>osfe(nTd)b)u f{f , | a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r g s| - group(group> recvbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 687 ^: 11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: 687note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s (RtuindW-otrikdESlteamretnBtct(-)>.oruutn,( wneu)l;l p t| r ^, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp>:s13e:n1d:b unote: fin instantiation of member function 'RunWork, 2, 2>::run' requested heref , ar g13s | -I>MrPeLc_vCbOuLfLf_,F U N| C ^( AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:c202e:,53 :C Onote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL NET_ D202I | R E C T , S I MRPuLnEW,o rSkuEml,e mrecnctl<_Fbnf,l oTa,t 1R6e)d O p| ,^ Algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391P:r95o:t onote: >expanded from macro 'IMPL_COLL_FUNC'( ).run( w391e | ) ; R u| n ^W ork, 2, 2>::run' requested here# func ,12 | tIyMpPeL,_ CFOuLnLc_#F#UdNeCv(rAeldloRpe ,C ONLCLCNLE_TA_LDGIOR_E#C#Ta,l gSoI,M PNLCEC,L _SPuRmO,T Od_o#u#bplreo)t o >| (^) .ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:(391&:n95c:c lnote: Sexpanded from macro 'IMPL_COLL_FUNC'h mem.wo r391k | ) ; R\u n W| o ^r k(,n tNhCrCeLa_dAsL)G,O _t#i#daIlngBol,o cNkC(CtLh_rPeRaOdTIOd_x#.#xp)r,o tgor>o(u)p.(rgurno(u&pn)c,c l S| h ^~~~~~~~~~~~~~~~~m em/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:w562o:r60k:) ;note: field 'group' will be initialized after field 'stepSize'\ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd ), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~, group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid( t563i | d ) , nsttherpeSaidzse((nntchcrleSahdmse)m,. ctoimdmI.nbBulfofcSki(ztehsr[eNaCdCILd_xP.RxO)T,O _gSrIoMuPpL(Eg]r/oNuCpC)L,_ S T| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P S /| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i zeof( T563) | ) { s| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e p S| i group(groupz e(ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:o687m:m11.:b unote: fin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref Sizes[ N687C | C L _ P R O T O _ S IpMrPiLmEs](/tNiCdC-Lt_iSdTSEtPaSr/tsBiczaesotf,( Tn)T)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~B c a| s group(groupt , &direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:-655>:o11u:t ,note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren ullpt r655, | a r g s - > s e n dpbruifmfs,( tairdg-st-i>drSetcavrbtuRfefd,u c e| , ^ nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:e202d:u53c:e ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren ullp t202r | , & d i r e c tR-u>noWuotr,k Ealregmse-n>tsArlegcov,b uPfrfo,t o >| ( ^) .run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 202 :| 53 ^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp: 13202: | 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here R u13n | WIoMrPkLE_lCeOmLeLn_tFE(C)T.,r uSnI(MwPeL)E;, S| u ^m , rccl_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppb:f13l:o1a:t 1note: 6in instantiation of member function 'RunWork, 2, 2>::run' requested here) | ^ 13 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391C:O95L:L _note: Fexpanded from macro 'IMPL_COLL_FUNC'U NC(All R391e | d u cReu,n WCoOrLkL^, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_391A:L95G:O _note: #expanded from macro 'IMPL_COLL_FUNC'# algo, N391C | C L _RPuRnOWToOr_k#<#npcrcoltFou>n(c)#.#rfuunn(c&,n ctcylpSeh,m eFmu.nwco#r#kd)e;v r\e d o| p ^< type>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C15C:L _note: Afield 'nthreads' will be initialized after field 'tidInBlock'L GO_## a562l | g o , NtCiCdL(_tPiRdO)T,O _n#t#hprreoatdos>((n)t.hrruena(d&sn)c,c ltSihdmIenmB.lwoocrkk()t;h r\e a d| I ^d x.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o15u:p (note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup), 562 | | ^~~~~~~~~~~~~~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i60d:) ,note: field 'group' will be initialized after field 'stepSize'n threa d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~~~~~~~x .x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r60o:u pnote: (field 'group' will be initialized after field 'stepSize'g roup) ,562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d), nthread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d x .| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , gro u563p | ( g r o uspt)e,p S i| z ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e ( n| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c lShmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| S group(groupT EPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:T655):)11 :{ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:r641i:m11s:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered -tidS t641a | r t R e d u c e , npTrhirmesa(dtsiRde-dtuicdeS,t anrutlRlepdturc,e ,& dniTrherceta-d>soRuetd,u caer,g sd-i>rseecntd-b>udfofw,n ,a r&gdsi-r>ercetc-v>bouuftf,, a r| g ^s ->sendbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,202 :a53r:g snote: -in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here> rec v202b | u f f , | ^ RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:l202e:m53e:n tnote: , 2, 2>::run' requested hereF n, T ,202 | R e d O p , A lRguon,W oPrrkoEtloe>m(e)n.tr, 2, 2>::run' requested heret o>(). r13u | nI(MwPeL)_;C O L| L ^_ FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cppl:l12R:e1d:u cnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested here, COL L12N | EITM_PDLI_RCEOCLTL,_ FSUINMCP(LAEl,l RSeudmu,c er,c cClO_LbLfNlEoTa_tD1I6R)E C T| ,^ SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:E391,: 95S:u mnote: ,expanded from macro 'IMPL_COLL_FUNC' double )391 | | ^R unWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k391<:n95c:c lnote: Fexpanded from macro 'IMPL_COLL_FUNC'u nc##f u391n | c , RtuynpWeo,r kFp,e ,N CFCuLn_cA#L#GdOe_v#r#eadlogpo<,t yNpCeC>L,_ PNRCOCTLO__A#L#GpOr_o#t#oa>l(g)o.,r uNnC(C&Ln_cPcRlOSThOm_e#m#.pwroortko)>;( )\. r u| n ^( &ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:m562e:m15.:w onote: rfield 'nthreads' will be initialized after field 'tidInBlock'k ); \ | 562 ^ | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )note: ,field 'nthreads' will be initialized after field 'tidInBlock' nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~x ), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60(:g rnote: ofield 'group' will be initialized after field 'stepSize'u p), | ^~~~~~~~~~~~~~~~~562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t60i:d (note: tfield 'group' will be initialized after field 'stepSize'i d), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddInBlocxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~g roup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 641 | : 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] prims(tid-ti d562S | t a r t Rteiddu(ctei,d )n,T hnrtehardesaRdesd(uncteh,r edaidrse)c,t -t>iddoIwnnB,l o&cdki(rtehcrte-a>doIudtx,. xa)r,g sg-r>osuepn(dgbruofufp,) ,a r g| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~- > r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c vbu f563f | , | ^s tepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n202c:c53l:S hnote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree m.co m202m | . b u f f S i z eRsu[nNWCoCrLk_EPlReOmTeOn_tS)()) .{r u n| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w e )| ; group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::1641:: 11note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 12 | I M641P | L _ C O L L _ F U N Cp(rAilmlsR(etdiudc-et,i dCSOtLaLrNtERTe_dDuIcReE,C Tn,T hSrIeMaPdLsER,e dSuucme,, ddoiurbelcet)- > d| o^w n, &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:r391e:c95t:- >note: oexpanded from macro 'IMPL_COLL_FUNC'u t, ar g391s | - > sReunndWbourfkf<,n cacrlgFsu-n>cr#e#cfvubnucf,f ,t y p| e ^, Func##d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:v202r:edop<53t:y pnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here> , NC C202L | _ A L G O _ # # aRlugnoW,o rNkCEClLe_mPeRnOtTd(O)p.,r uAnl(g&on,c cPlrSohtmoe>m(.)w.orrukn)(;w e\) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp::1513:: 1note: :field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 13 | I M PtLi_dC(OtLiLd_)F,U NnCt(hArlelaRdesd(uncteh,r eCaOdLsL)N,E Tt_iDdIIRnEBClTo,c kS(ItMhPrLeEa,d ISduxm.,x )r,c cglr_obufpl(ogarto1u6p)) , | ^| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::391562::9560:: note: note: expanded from macro 'IMPL_COLL_FUNC'field 'group' will be initialized after field 'stepSize' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~L _PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid( 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 67 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hgroup):,562 : 15| : ^~~~~~~~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60: note: field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSiIn file included from z/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cppe:s1[: NIn file included from C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:L10_: PIn file included from R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hO:T167O: _/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ]warning: /initializer order does not match the declaration order [-Wreorder-ctor]N CCL_STEPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e916a:d7s:) ,note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested heret idIn B916l | o c k ( t h rperaidmIsd(xg.rxo)u,p Tgirdo,u pg(rgoruopuNpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s , | & tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ecv, &563s | e n d , satregpsS-i>zsee(nndcbculfSfh,m eamr.gcso-m>mr.ebcuvfbfuSfifz,e s [| N ^C CL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:T202O:_53S:I Mnote: Pin instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereL E]/N C202C | L _ S T E P S / sRiuzneWoofr(kTE)l)e m{e n t| < ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F n ,| group(groupT , RedOp, Algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:r916o:t7o:> (note: )in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here. run(w e916) | ; | ^ prims/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp(:g6r:o1u:p Tnote: iin instantiation of member function 'RunWork, 3, 2>::run' requested hered , gr o6u | pINMtPhLr_eCaOdLsL,_ F&UrNeCc(vA,l l&Rseednudc,e ,a rCgOsL-L>NsEeTn_dCbHuAfIfN,, aSrIgMsP-L>Er,e cMvabxu,f fi,n t 3| 2 ^_ t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::53391:: 95note: :in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here note: expanded from macro 'IMPL_COLL_FUNC' 202 | 391 | R uRnuWnoWrokro(p)<.tryupne(>w,e )N;C C L| _ ^A LGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cppa:l4g:o1,: Nnote: Cin instantiation of member function 'RunWork, 3, 2>::run' requested hereC L_PR O4T | OI_M#P#Lp_rCoOtLoL>_(F)U.NrCu(nA(l&lnRcecdluSchem,e mC.OwLoLrNkE)T;_ C\H A I| N ^, SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:,15 :M anote: xfield 'nthreads' will be initialized after field 'tidInBlock', int 8562_ | t ) | t^i d(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:)391,: 95n:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e ads(nt h391r | e a dRsu)n,W otrikd:,562 :N60C:C Lnote: _field 'group' will be initialized after field 'stepSize'A LGO_ #562# | a l g o ,t iNdC(CtLi_dP)R,O TnOt_h#r#epardost(on>t(h)r.eraudns()&,n ctcildSIhnmBelmo.cwko(rtkh)r;e a\d I d| x ^. x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15):, note: field 'nthreads' will be initialized after field 'tidInBlock'| ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &rec/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563v, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ _t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P RtOiTdOI_n#B#lporcokt(ot>h(r)e.arduInd(x&.nxc)c,l Sghrmoeump.(wgorroku)p;) ,\ | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ ^ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h563: | 562 : 15 : snote: tfield 'nthreads' will be initialized after field 'tidInBlock'e pSize( n562c | c l S h mteimd.(ctoimdm).,b unftfhSriezaedss[(NnCtChLr_ePaRdOsT)O,_ StIiMdPILnEB]l/oNcCkC(Lt_hSrTeEaPdSI/dsxi.zxe)o,f (gTr)o)u p{( g r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u p )| , group(group | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::60916:: 7note: :field 'group' will be initialized after field 'stepSize' note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 916 | t i d ( tpirdi)m,s (ngtrhoruepaTdisd(,n tghrroeuapdNst)h,r etaiddsI,n B&lroecckv(,t h&rseeanddI,d xa.rxg)s,- >gsreonudpb(ugfrfo,u pa)r,g s -| > ^~~~~~~~~~~r ecvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkEle 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ment().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grIn file included from o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cppu:p1): ,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 10 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)167 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56315 | : warning: initializer order does not match the declaration order [-Wreorder-ctor] stepSize(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 916warning: | initializer order does not match the declaration order [-Wreorder-ctor] prim s562( | g r o u ptTiidd(,t igdr)o,u pnNtthhrreeaaddss(,n t&hrreecavd,s )&,s etnidd,I naBrlgosc-k>(stehnrdebaudfIfd,x .axr)g,s -g>rroeucpv(bgurfofu,p ) ,| ^ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53563: | note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here ste p202S | i z e ( n c c l SRhmeumn.WcoormkmE.lbeumfefnStiL(_)S.TrEuPnS(/wsei)z;e o f| ( ^T )) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 7 :| 1 group(group: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:M916P:L7_:C Onote: Lin instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereL _FUNC (916A | l l R e d u cper,i mCsO(LgLrNoEuTp_TCiHdA,I Ng,r oSuIpMNPtLhEr,e aPdrso,d ,& rueicnvt,3 2&_ste)n d ,| ^a rgs->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:n391d:b95u:f fnote: ,expanded from macro 'IMPL_COLL_FUNC' args->r e391c | v b uRfufn,W o r| k ^< ncclF/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202c:#53#:f unote: nin instantiation of member function 'RunWorkElement, 3, 2>::run' requested herec , ty p202e | , F u n c # # dReuvnrWeodrokpEt,< FNnC,C LT_,A LRGeOd_O#p#,a lAglog,o ,N CPCrLo_tPoR>O(T)O._r#u#np(rwoet)o;> ( )| . ^r un(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cppc:l6S:h1m:e mnote: .in instantiation of member function 'RunWork, 3, 2>::run' requested herew ork) ;6 | \I M P| L ^_ COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hU:N562C:(15A:l lnote: Rfield 'nthreads' will be initialized after field 'tidInBlock'e duce, C562O | L L N E Tt_iCdH(AtIiNd,) ,S InMtPhLrEe,a dPsr(ondt,h rienatd3s2)_,t )t i d| I^n Bloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:(391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC'I dx.x) ,391 | g r oRuupn(Wgorroku, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(w e562) | ; | ^t id(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp,:6:1: nnote: tin instantiation of member function 'RunWork, 3, 2>::run' requested hereh reads (6nt | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ICdOxL.LxN)E,T _gCrHoAuIpN(,g rSoIuMpP)L,E , | P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o d| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) int3 2563_ | t ) | s^t epSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391c:c95l:S hnote: mexpanded from macro 'IMPL_COLL_FUNC'e m.comm. b391u | f f SRiuzneWso[rNkC| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ N C| C group(groupL _ALGO_##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:,916 :N7C:C Lnote: _in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereP ROTO _916# | # p r o t o >p(r)i.mrsu(ng(r&onucpcTliSdh,m egmr.owuoprNkt)h;r e\a d s| , ^ &recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562&:s15e:n dnote: ,field 'nthreads' will be initialized after field 'tidInBlock' args- >562s | e n d b utfifd,( tairdg)s,- >nrtehcrvebaudfsf(,n t h| r ^e ads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53I:n Bnote: lin instantiation of member function 'RunWorkElement, 3, 2>::run' requested hereo ck( t202h | r e a d I d x . xR)u,n WgorrokuEpl(egmreonutp<)F,n , | T ^~~~~~~~~~~~~~~~~, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:p562,: 60A:l gnote: ofield 'group' will be initialized after field 'stepSize', Prot o562> | ( ) . r utni(dw(et)i;d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cppd:s13(:n1t:h rnote: ein instantiation of member function 'RunWork, 3, 2>::run' requested herea ds), t13i | dIIMnPBLl_oCcOkL(Lt_hFrUeNaCd(IAdlxl.Rxe)d,u cger,o uCpO(LgLrNoEuTp_)C,H A I| N ^~~~~~~~~~~, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nt h562r | e a d s (tnitdh(rteiadd)s,) ,n tthirdeIandBsl(onctkh(rtehardesa)d,I dtxi.dxI)n,B lgorcoku(pt(hgrreoaudpI)d,x . x| ) ^~~~~~~~~~~, group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h<:F562n:,15 :T ,warning: initializer order does not match the declaration order [-Wreorder-ctor]R edOp, Alg o562, | P r o ttoi>d(()t.irdu)n,( wnet)h;r e a| d ^s (nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpps:)10,: 1t:i dnote: Iin instantiation of member function 'RunWork, 3, 2>::run' requested heren Bloc k10( | tIhMrPeLa_dCIOdLxL._xF)U,N Cg(rAolulpR(egdruocuep,) ,C O L| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N E T| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C HAIN, 563S | I M P L Es,t ePprSoidz,e (hnaclcfl)S h m| e^m .comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:b391u:f95f:S inote: zexpanded from macro 'IMPL_COLL_FUNC'e s[NCCL _391P | R O TROu_nSWIoMrPkL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 916N:C7C:L _note: Ain instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereL GO_# #916a | l g o , N CpCrLi_mPsR(OgTrOo_u#p#Tpirdo,t og>r(o)u.prNutnh(r&enacdcslS,h m&erme.cwvo,r k&)s;e n\d , | a ^r gs->se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:d562b:u15f:f ,note: field 'nthreads' will be initialized after field 'tidInBlock'a rgs-> r562e | c v b u ftfi,d ( t| i ^d ), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:s53(:n tnote: hin instantiation of member function 'RunWorkElement, 3, 2>::run' requested herer eads), tid I202n | B l o c k ( t h rReuandWIodrxk.Exl)e,m egnrtofield 'group' will be initialized after field 'stepSize'( ).run (562w | e ) ; t| i ^d (tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cppt:h9r:e1a:d snote: (in instantiation of member function 'RunWork, 3, 2>::run' requested heren thre a9d | sI)M,P Lt_iCdOILnLB_lFoUcNkC((tAhlrleRaeddIudcxe.,x )C,O LgLrNoEuTp_(CgHrAoIuNp,) ,S I M| P ^~~~~~~~~~~L E, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562C:O15L:L _warning: Finitializer order does not match the declaration order [-Wreorder-ctor]U NC(AllRe d562u | c e , CtOiLdL(NtEiTd_)C,H AnItNh,r eSaIdMsP(LnEt,h rPeraodds,) ,h atlifd)I n B| l^o ck(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a391d:I95d:x .note: xexpanded from macro 'IMPL_COLL_FUNC') , grou p391( | g r oRuupn)W,o r k| < ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n c c| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)F unc# #563f | u n c , sttyeppeS,i zFeu(nncc#c#ldSehvmreemd.ocpof,f SNiCzCeLs_[ANLCGCOL__#P#RaOlTgOo_,S INMCPCLLE_]P/RNOCTCOL__#S#TpErPoSt/os>i(z)e.orfu(nT()&)n c{c l S| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e m| . group(groupw ork); \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 916 ^: 7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15916: | note: field 'nthreads' will be initialized after field 'tidInBlock' pri m562s | ( g r o utpiTdi(dt,i dg)r,o unptNhtrheraedasd(sn,t h&rreeacdvs,) ,& steinddI,n Balrogcsk-(>tshernedabduIfdfx,. xa)r,g sg-r>oruepc(vgbruofufp,) , | ^| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::20260::53 :note: field 'group' will be initialized after field 'stepSize'note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 562202 | | t i d ( tRiudn)W,o rnktEhlreemaednst(a(d)I.drxu.nx()w,e )g;r o u| p ^( group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp,: 10 :| 1 ^~~~~~~~~~~: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hidI:n562B:l15o:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t hreadIdx.x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~n threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group L_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ]warning: /initializer order does not match the declaration order [-Wreorder-ctor]N CCL_STEP S562/ | s i z e otfi(dT()t)i d{) , | n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i641d:I11n:B lnote: oin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec k(thre a641d | I d x . x ) , g r opurpi(mgsr(otuipd)-,t i d| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t a r| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R educ e563, | n T h rsetaedpsSRiezdeu(cnec,c ldSihrmeecmt.-c>odmomw.nb,u f&fdSiirzeecst[-N>CoCuLt_,P RaOrTgOs_-S>IsMePnLdEb]u/fNfC,C La_rSgTsE-P>Sr/escivzbeuofff(,T ) )| ^{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h group(group: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655: 11202: | note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here R u655n | W o r k E l e m e n tpT(h)r.eraudns(Rweed)u;c e ,| ^n ullptr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp,: 5&:d1i:r enote: cin instantiation of member function 'RunWork, 2, 2>::run' requested heret ->ou t5, | IaMrPgLs_-C>OsLeLn_dFbUuNfCf(,A lalrRgesd-u>cree,c vCbOuLfLfN,E T _| D ^I RECT, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:M202P:L53E:, note: Pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer od, u202i | n t 8 _ t ) | R^u nWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:l391e:m95e:n tnote: , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:U562L:L15,: dwarning: iinitializer order does not match the declaration order [-Wreorder-ctor]r ect->up, 562a | r g s - >tsiedn(dtbiudf)f,, natrhgrse-a>drse(cnvtbhurfefa,d s )| , ^ tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:o202c:k53(:t hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree adI d202x | . x ) , g r o uRpu(ngWroorukpE)l,e m e| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t < F| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), T, R e563d | O p , Asltgeop,S iPzreo(tnoc>c(l)S.hrmuenm(.wceo)m;m . b| u ^f fSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpps:[4N:C1C:L _note: Pin instantiation of member function 'RunWork, 2, 2>::run' requested hereR OTO _4S | IIMMPPLLE_]C/ONLCLC_LF_USNTCE(PASl/lsRiezdeuocfe(,T )C)O L{L N E| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ D I| R group(groupE CT, SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 655P:r11o:d ,note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei nt8_t )655 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95p:r inote: mexpanded from macro 'IMPL_COLL_FUNC's (tid- t391i | d S tRaurntWRoerdkutoyupte,> ,a rNgCsC-L>_sAeLnGdOb_u#f#fa,l gaor,g sN-C>CrLe_cPvRbOuTfOf_,# # p| r ^o to>().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:n202(:&53n:c cnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS hmem .202w | o r k ) ; \ R| u ^n WorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:e562m:e15n:t r(e)a.drsu(nn(twher)e;a d s| ) ^, tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppl:o4c:k1(:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree adId x4. | xI)M,P Lg_rCoOuLpL(_gFrUoNuCp()A,l l R| e ^~~~~~~~~~~~~~~~~d uc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:,562 :C60O:L Lnote: Nfield 'group' will be initialized after field 'stepSize'E T_DI R562E | C T , StIiMdP(LtEi,d )P,r ondt,h rienatd8s_(tn)t h r| e^a ds), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:I95n:B lnote: oexpanded from macro 'IMPL_COLL_FUNC'c k(threa d391I | d x .Rxu)n,W ogrrko, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock': 562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 60 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.Tx), gOr_oSuIpM(PgLrEo]u/pN)C,C L _| S ^~~~~~~~~~~T EPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInB RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidSta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:t562S:c15a:t twarning: einitializer order does not match the declaration order [-Wreorder-ctor]r , nThread s562S | c a t t etri,d (NtUiLdL),, dnitrherceta-d>su(pn,t harregasd-s>)s,e ntdibduIfnfB,l oacrkg(st-h>rreeacdvIbduxf.fx,) , | g ^r oup(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p202):,53 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | RsutneWpoSrikzEel(enmcecnltSR(O)T.Or_uSnI(MwPeL)E;] / N| C ^C L_STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppS:/5s:i1z:e onote: fin instantiation of member function 'RunWork, 2, 2>::run' requested here( T)) {5 | I| M ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P L _| C group(groupO LL_FUNC(A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:l666R:e9d:u cnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, COLL N666E | T _ D I R E C T ,p rSiImMsP(LtEi,d ,P rnoTdh,r euaidnstG8a_tth)e r ,| ^d irect-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:u391p:,95 :N Unote: Lexpanded from macro 'IMPL_COLL_FUNC'L , args- >391s | e n dRbuunfWfo,r kaFruenccv#b#uffufn,c , | t ^y pe, Fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c202#:#53d:e vnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree dop< t202y | p e >, N C C L _RAuLnGWOo_r#k#Eallegmoe,n tNo(,) .Prruont(o&>n(c)c.lrSuhnm(ewme.)w;o r k| ) ^; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWork, 2, 2>::run' requested here562 :15: note: 5field 'nthreads' will be initialized after field 'tidInBlock' | IMPL_CO L562L | _ F U N Ct(iAdl(ltRiedd)u,c en,t hCrOeLaLdNsE(Tn_tDhIrReEaCdTs,) ,S ItMiPdLIEn,B lPorcokd(,t hurienatd8I_dtx). x )| ,^ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(391g:r95o:u pnote: )expanded from macro 'IMPL_COLL_FUNC', | ^~~~~~~~~~~~~~~~~ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k i,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~) .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~563 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) step S563i | z e ( n csctleSphSmiezme.(cnocmcml.SbhumfefmS.iczoemsm[.NbCuCfLf_SPiRzOeTsO[_NSCICMLP_LPER]O/TNOC_CSLI_MSPTLEEP]S//NsCiCzLe_oSfT(ETP)S)/ s{i z e| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f ( T| ) group(group) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : group(group655 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641 : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here prims(t i641d | - t i d S t a r t R epdruicmes,( tniTdh-rteiaddSstRaerdtuRceed,u cneu,l lnpTthrr,e a&ddsiRreedcutc-e>,o udti,r eacrtg-s>down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]t id(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~/ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ -/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:s562e:n15d:b uwarning: finitializer order does not match the declaration order [-Wreorder-ctor]f , args->rec v562b | u f f , t i| d ^( tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Block(th r202e | a d I d x . x ) ,R ugnrWoourpk(Eglreomuepn)t,< F n| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ T ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R edOp, 563A | l g o , sPtreoptSoi>z(e)(.nrcucnl(Swhem)e;m . c| o ^m m.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppS:i6z:e1s:[ Nnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereC L_P R6O | TIOM_PSLI_MCPOLLEL]_/FNUCNCCL(_ASlTlERPeSd/usciez,e oCfO(LTL)N)E T{_ D I| R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E C T| , group(group SIMPLE, Prod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 666i:n9t:3 2note: _in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret ) | ^ 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: pexpanded from macro 'IMPL_COLL_FUNC'r ims(t i391d | , nRTuhnrWeoardks utpy,p eN,U LFLu,n ca#r#gdse-v>rseednodpba,r gNsC-C>Lr_eAcLvGbOu_f#f#,a l g| o ^, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:O202T:O53_:# #note: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer oto>( )202. | r u n ( &n c c l SRhumneWmo.rwkoErlke)m;e n\t < F| n ^, T, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:p562,: 15A:l gnote: ofield 'nthreads' will be initialized after field 'tidInBlock', Prot o562> | ( ) . r utni(dw(et)i;d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:s5(:n1t:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea ds), 5t | iIdMIPnLB_lCoOcLkL(_tFhUrNeCa(dAIldlxR.exd)u,c eg,r oCuOpL(LgNrEoTu_pD)I,R E C| T ^~~~~~~~~~~~~~~~~, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L60E:, note: Pfield 'group' will be initialized after field 'stepSize'r od, u562i | n t 8 _ tt)i d (| t^i d), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC'( nthread s391) | , tRiudnIWnoBrlko, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562I:d15x:. xwarning: )initializer order does not match the declaration order [-Wreorder-ctor], group(gro u562p | ) , | t ^~~~~~~~~~~i d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : group(group15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 677 | t i d ( t i d ) , pnrtihmrse(atdisd(-nttihdrSetaadrst)B,c atsitd,I nnBTlhorceka(dtshBrceaasdtI,d x&.dxi)r,e cgtr-o>uopu(tg,r oduipr)e,c t -| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d o w| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), ar g563s | - > s e nsdtbeupfSfi,z ea(rngcsc-l>Srhemcevmb.ucfofm,m . b| u ^f fSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h[:N202C:C53L:_ Pnote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereO TO_S I202M | P L E ] / N C C LR_uSnTWEoPrSk/Esliezmeeonft(()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:u666n:(9w:e )note: ;in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp : 5 : 1 : pnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested herei ms(t i5d | ,I MnPTLh_rCeOaLdLs_GFaUtNhCe(rA,l ldRierdeuccte-,> uCpO,L LNNUELTL_,D IaRrEgCsT-,> sSeInMdPbLuEf,f ,P raordg,s -u>irnetc8v_btu)f f ,| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562202::1553:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 562 | R u n Wtoirdk(Etliedm)e,n tno(c)k.(rtuhnr(ewaed)I;d x .| x ^) , group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppg:r6o:u1p:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 6| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I MPL_C O563L | L _ F U NsCt(eAplSliRzeed(unccec,l SChOmLeLmN.EcTo_mDmI.RbEuCfTf,S iSzIeMsP[LNEC,C LP_rPoRdO,T Oi_nStI3M2P_LtE)] / N| C^C L_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:S391/:s95i:z enote: oexpanded from macro 'IMPL_COLL_FUNC'f (T)) { 391 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R u| n group(groupW ork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep e, Fu n641c | # # d e v r e d o p (,t iNdC-CtLi_dASLtGaOr_t#R#eadlugcoe,, NnCTChLr_ePaRdOsTROe_d#u#cper,o tdoi>r(e)c.tr-u>nd(o&wnnc,c l&Sdhimreem.work); \ c| t ^- >out, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562g:s15-:> note: field 'nthreads' will be initialized after field 'tidInBlock' send b562u | f f , atrigds(-t>irde)c,v bnutfhfr,e a d| s ^( nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202t:i53d:I nnote: Bin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel ock( t202h | r e a d I d x . xR)u,n WgorrokuEpl(egmreonutp<)F,n , | T ^~~~~~~~~~~~~~~~~, Re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:,60 :A lnote: gfield 'group' will be initialized after field 'stepSize'o , P r562o | t o > ( )t.irdu(nt(iwde)),; n t| h ^r eads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppr:e6a:d1s:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested heret idInB l6o | cIkM(PtLh_rCeOaLdLI_dFxU.NxC)(,A lglrRoeudpu(cger,o uCpO)L,L N E| T ^~~~~~~~~~~_ DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tid^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Idx.x) ,563 | g r o u ps(tgerpoSuipz)e,( n c| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l S h| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e m.co m563m | . b u f fsStiezpeSsi[zNeC(CnLc_cPlRSOhTmOe_mS.IcMoPmLmE.]b/uNfCfCSLi_zSeTsE[PNSC/CsLi_zPeRoOfT(OT_)S)I M{P L E| ] ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/ N C| C group(groupL _STEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:f655(:T11):) note: {in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 655 : 11 : pnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei ms(t i655d | - t i d S t a r t R epdruicmes,( tniTdh-rteiaddSstRaerdtuRceed,u cneu,l lnpTthrr,e a&ddsiRreedcutc-e>,o untu,l laprtgrs,- >&sdeinrdebcutf-f>,o uatr,g sa-r>grse-c>vsbeunfdfb,u f f| , ^ args->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hb:u202f:f53,: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^ 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here Run W202o | r k E l e m e n tR,( )A.lrguon,( wPer)o;t o >| ( ^) .run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp;: 4 :| 1 ^: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6 :41 | :I Mnote: Pin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _CO L6L | _IFMUPNLC_(CAOlLlLR_eFdUuNcCe(,A lClORLeLdNuEcTe_,D ICROELCLTN,E TS_IDMIPRLEEC,T ,P rSoIdM,P LiEn,t 8P_rto)d , | i^n t32_t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 391 :| 95^: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R u391n | W o rRku<,t yNpCeC>L,_ ANLCGCOL__#A#LaGlOg_o#,# aNlCgCoL,_ PNRCOCTLO__P#R#OpTrOo_t#o#>p(r)o.trou>n(()&.nrcucnl(S&hnmcecml.Swhomrekm).;w o\r k )| ; ^ \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :field 'nthreads' will be initialized after field 'tidInBlock'15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d ( ttiidd)(,t indt)h,r enatdhsr(enatdhsr(enatdhsr)e,a dtsi)d,I ntBildoIcnkB(ltohcrke(atdhIrdexa.dxI)d,x .gxr)o,u pg(rgoruopu(pg)r,o u p| ) ^~~~~~~~~~~~~~~~~, | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56260::60 :note: field 'group' will be initialized after field 'stepSize'note: field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 5: | 15I:M Pwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ COLL_FUNC (562A | l l R e dtuicde(,t iCdO)L,L NnEtTh_rDeIaRdEsC(Tn,t hSrIeMaPdLsE),, PtrioddI,n Bulionctk8(_tth)r e a| d^I dx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC'( group) ,391 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R u n| W tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rk[,N CNCCLC_LP_RAOLTGOO__S#I#MaPlLgEo],/ NNCCCCLL__SPTREOPTSO/_s#i#zperooft(oT>)()) .{r u n| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~& n c| c group(groupl Shmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:)626;: 9\: note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :626562 | : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' pri m562s | ( t i d -ttiidd(Sttiadr)t,S cnatthtreera,d sn(TnhtrheraedasdSsc)a,t tteird,I nNBUlLoLc,k (dtihrreecatd-I>duxp.,x )a,r ggsr-o>uspe(ngdrbouufpf),, a r| g ^~~~~~~~~~~~~~~~~s ->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562c:v60b:u fnote: ffield 'group' will be initialized after field 'stepSize', | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :t53i:d (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d), n202t | h r e a d s ( n tRhurneWaodrsk)E,l etmiednItng(r)o.urpu)n,( w e| ) ^~~~~~~~~~~; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:U15L:L ,warning: initializer order does not match the declaration order [-Wreorder-ctor]a rgs->se n562d | b u f f ,t iadr(gtsi-d>)r,e cnvtbhurfefa,d s (| n ^t hreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202t:i53d:I nnote: Bin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel ock( t202h | r e a d I d x . xR)u,n WgorrokuEpl(egmreonutp<)F,n , | T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, R| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d Op, A l563g | o , P rsotteop>S(i)z.er(unnc(cwleS)h;m e m| . ^c omm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppf:S5i:z1e:s [note: Nin instantiation of member function 'RunWork, 2, 2>::run' requested hereC CL_P R5O | TIOM_PSLI_MCPOLLEL]_/FNUCNCCL(_ASlTlERPeSd/usciez,e oCfO(LTL)N)E T{_ D I| R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E C T| , group(group SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hP:r687o:d11,: unote: iin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren t8_t) 687 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95p:r inote: mexpanded from macro 'IMPL_COLL_FUNC's (tid-t i391d | S t aRrutnBWcoarskt,< nncTchlrFeuandcs#B#cfausntc,, &tdyipree,c tF-u>nocu#t#,d envurleldpotpr<,t yapreg>s,- >NsCeCnLd_bAuLfGfO,_ #a#raglsg-o>,r eNcCvCbLu_fPfR,O T O| _ ^# #proto>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:r202u:n53(:& nnote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec lShme m202. | w o r k ) ; \ R u| n ^W orkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562e:n15t:< Fnote: nfield 'nthreads' will be initialized after field 'tidInBlock', T, R e562d | O p , Atligdo(,t iPdr)o,t on>t(h)r.eraudns((wnet)h;r e a| d ^s ), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppB:l5o:c1k:( tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eadI d5x | .IxM)P,L _gCrOoLuLp_(FgUrNoCu(pA)l,l R e| d ^~~~~~~~~~~~~~~~~u ce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :C562O:L60L:N Enote: Tfield 'group' will be initialized after field 'stepSize'_ DIREC T562, | S I M PtLiEd,( tPirdo)d,, nutihnrte8a_dts)( n t| h^r eads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391t:i95d:I nnote: Bexpanded from macro 'IMPL_COLL_FUNC'l ock(thr e391a | d I dRxu.nxW)o,r kg, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562C:C15L:_ Pwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]O TO_##pro t562o | > ( ) . rtuind((&tnicdc)l,S hnmtehmr.ewaodrsk()n;t h\r e a| d ^s ), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads )563, | t i d IsntBelpoScikz(et(hnrcecaldSIhdmxe.m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hoto>:(562):.15r:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu nwarning: (:initializer order does not match the declaration order [-Wreorder-ctor]&562 n:c15c:l Swarning: h initializer order does not match the declaration order [-Wreorder-ctor]m562 e | m . w o rtki)d;( t562\i | d ) | , ^ nttih/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hdr:(e562ta:id15ds:)( ,nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'nh trher ae562da | sd )s ,( n tttihidrdIe(natBdilsdo))c,,k (tntithdhrIreneaBadldIosdc(xkn.(txth)hr,re eagadrdsoI)ud,px (.tgxir)do,Iu npgB)rl,oo uc pk| (( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~gt rh or| ue tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)pa )d,I d 563x| | . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ x ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)sg treopu Sp563i( | zg er (o nu cpsc)tl,eS ph Sm| ie ^~~~~~~~~~~~~~~~~zm e.(c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hno:cm562cm:l.60Sb:hu mfnote: effield 'group' will be initialized after field 'stepSize'mS .icz oe562ms | m[ .N bC uC fLtf_iSPdiR(zOteTisOd[_)NS,CI CMnLPt_LhPErR]eO/aTNdOCs_C(SLnI_tMShPTrLEeEPa]Sd//sNs)Ci,Cz Let_oiSfdT(IETnP)BS)l/ os{ci kz (e| to ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~hf r( eT| a) group(groupd) I d{x . x| /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~:641, : 11g| :r group(group o note: uin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep (group )641, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : | 666 ^~~~~~~~~~~: 9 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here pri m666s | ( t i d - t i d SptrairmtsR(etdiudc,e ,n TnhTrheraedasdGsaRtehdeurc,e ,d idriercetc-t>-u>pd,o wNnU,L L&,d iarregcst-->>soeuntd,b uafrfg,s -a>rsgesn-d>bruefcfv,b uafrfg,s - >| r ^e cvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:,202 : 53| : ^ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR unWor k202E | l e m e n t < F nR,u nTW,o rRkeEdlOepm,e nAtld(O)p.,r uAnl(gwoe,) ;P r o| t ^o >().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppu:n6(:w1e:) ;note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 6 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp_:C7O:L1L:_ Fnote: Uin instantiation of member function 'RunWork, 2, 2>::run' requested hereN C(Al l7R | eIdMuPcLe_,C OCLOLL_LFNUENTC_(DAIlRlERCeTd,u cSeI,M PCLOEL,L NPErTo_dD,I RiEnCtT3,2 _StI)M P L| E^, Prod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391u:i95n:t 3note: 2expanded from macro 'IMPL_COLL_FUNC'_ t) | ^391 | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:W391o:r95k:< nnote: cexpanded from macro 'IMPL_COLL_FUNC'c lFunc# #391f | u n cR,u ntWyoprek,< nFcucnlcF#u#ndce#v#rfeudnocp,< ttyyppee>,, FNuCnCcL#_#AdLeGvOr_e#d#oaplC,C LN_CPCRLO_TAOL_G#O#_p#r#oatlog>o(,) .NrCuCnL(_&PnRcOcTlOS_h#m#epmr.owtoor>k());. r\u n (| & ^n cclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:.562w:o15r:k )note: ;field 'nthreads' will be initialized after field 'tidInBlock' \ | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd ), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :t inote: dfield 'group' will be initialized after field 'stepSize'( tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r, etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~r oup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :6562 | :I15M:P Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]C OLL_FUNC (562A | l l R e dtuicde(,t iCdO)L,L NnEtThreads_(DnItRhErCeTa,d sS)I,M PtLiEd,I nPBrloodc,k (itnhtr3e2a_dtI)d x .| x^) , gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:p391(:g95r:o unote: pexpanded from macro 'IMPL_COLL_FUNC') , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 391| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) Run W563o | r k < n csctleFpuSnicz#e#(fnucnccl,S htmyepme.,c oFmumn.cb#u#fdfeSvirzeedso[pNR,O TNOC_CSLI_MAPLLGEO]_/#N#CaClLg_oS,T ENPCSC/Ls_iPzReOoTfO(_T#)#)p r{o t o| > ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( ) .| r group(groupu n(&ncclS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:m641e:m11.:w onote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herek ); \ 641| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :p15r:i mnote: sfield 'nthreads' will be initialized after field 'tidInBlock'( tid-t i562d | S t a r ttRiedd(utcied,) ,n TnhtrheraedasdRse(dnutcher,e addisr)e,c tt-i>ddIonwBnl,o c&kd(itrherceta-d>Ioduxt.,x )a,r ggsr-o>uspe(ngdrbouufpf),, a r| g ^~~~~~~~~~~~~~~~~s ->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b60u:f fnote: ,field 'group' will be initialized after field 'stepSize' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202t:i53d:( tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered ), n t202h | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBr(o)u.pr)u,n ( w| e ^~~~~~~~~~~) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(All/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:e562d:u15c:e ,warning: initializer order does not match the declaration order [-Wreorder-ctor]C OLLNET_D I562R | E C T , tSiIdM(PtLiEd,) ,P rnotdh,r eiandts3(2n_tth)r e a| d^s ), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I391n:B95l:o cnote: kexpanded from macro 'IMPL_COLL_FUNC'( thread I391d | x . xR)u,n Wgorroku, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(nccl:S562h:m15e:m .warning: cinitializer order does not match the declaration order [-Wreorder-ctor]o mm.buffSizes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hp:(655g:r11o:u pnote: )in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 655| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | p rsitmesp(Stiizde-(tnicdcSltSahrmteRme.dcuocmem,. bnuTfhfrSeiazdessR[eNdCuCcLe_,P RnOuTlOl_pStIrM,P L&Ed]i/rNeCcCtL-_>SoTuEtP,S /asrigzse-o>fs(eTn)d) { b| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f f ,| group(groupa rgs->recvbuff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^: 666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53 :666 | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s ( t iRdu,n WnoTrhkElermeeandtslugpo,, NPUrLoLt,o >a(r)g.sr-u>ns(ewned)b;u f f| , ^ args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp>:r5e:c1v:b unote: fin instantiation of member function 'RunWork, 2, 2>::run' requested heref , | 5 ^ | IMPL_COLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hU:N202C:(53A:l lnote: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree duce, 202C | O L L N E T _ D IRRuEnCWTo,r kSEIlMePmLeEn,t 95(:) .note: rexpanded from macro 'IMPL_COLL_FUNC'u n(we) ;391 | | ^R unWork, 2, 2>::run' requested here# func ,6 | tIyMpPeL,_ CFOuLnLc_#F#UdNeCv(rAeldloRpe ,C ONLCLCNLE_TA_LDGIOR_E#C#Ta,l gSoI,M PNLCEC,L _PPrRoOdT,O _i#n#tp3r2o_tto)> ( )| .^r un(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:c391l:S95h:m enote: mexpanded from macro 'IMPL_COLL_FUNC'. work); 391\ | | R ^u nWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h<:n562c:c15l:F unote: nfield 'nthreads' will be initialized after field 'tidInBlock'c ##fu n562c | , t y ptei,d (Ftuindc)#,# dnetvhrreedaodps<(tnytpher>e,a dNsC)C,L _tAiLdGIOn_B#l#oaclkg(ot,h rNeCaCdLI_dPxR.OxT)O,_ #g#rporuopt(og>r(o)u.pr)u,n ( &| n ^~~~~~~~~~~~~~~~~c cl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e60m:. wnote: ofield 'group' will be initialized after field 'stepSize'r k); \562 | | ^ tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::687562::1115:: note: warning: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 687 | 562 | ptriidm(st(itdi)d,- tnitdhSrteaardtsB(cnatshtr,e andTsh)r,e atdisdBIcnaBslto,c k&(dtihrreecatd-I>doxu.tx,) ,n uglrloputpr(,g rargs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'562 :15: warning: 562initializer order does not match the declaration order [-Wreorder-ctor] | tid(t i562d | ) , n tthirde(atdisd()n,t hnrtehardesa)d,s (tnitdhIrneBaldosc)k,( tthirdeIandBIldoxc.kx()t,h rgeraoduIpd(xg.rxo)u,p )g,r o u| p ^~~~~~~~~~~~~~~~~( gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562):,60 : | note: ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~field 'group' will be initialized after field 'stepSize' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCk(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562562: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor]t id(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:) ,note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 202: | 562 : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWorkE l562e | m e n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppx:.5x:)1,: gnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo up(g r5o | uIpM)P,L _ C| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L L _| F tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)U NC(A l563l | R e d u cset,e pCSOiLzLeN(EnTc_cDlISRhEmCeTm,. cSoImMmP.LbEu,f fPSriozde,s [uNiCnCtL8__PtR)O T O| _^S IMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h]:/391N:C95C:L _note: Sexpanded from macro 'IMPL_COLL_FUNC'T EPS/si z391e | o f (RTu)n)W o{r k <| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c c l| F group(groupu nc##func, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:y666p:e9,: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement562( | ) . r u nt(iwde()t;i d )| , ^ nth reads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppn:t7h:r1e:a dnote: sin instantiation of member function 'RunWork, 2, 2>::run' requested here) , ti d7I | nIBMlPoLc_kC(OtLhLr_eFaUdNICd(xA.lxl)R,e dgurcoeu,p (CgOrLoLuNpE)T,_ D I| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E C T| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) SIMPL E563, | P r o ds,t eupiSnitz3e2(_ntc)c l S| h^m em.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:m391m:.95b:u fnote: fexpanded from macro 'IMPL_COLL_FUNC'S izes[N C391C | L _ PRRuOnTWOo_rSkI, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:L677_:A11L:G Onote: _in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here# #algo, 677N | C C L _ P R O T O _ #p#rpirmost(ot>i(d)-.triudnS(t&anrctcBlcSahsmte,m .nwTohrrke)a;d s\B c a| s ^t , &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:r562e:c15t:- >note: ofield 'nthreads' will be initialized after field 'tidInBlock'u t, di r562e | c t - > dtoiwdn(,t iadr)g,s -n>tshernedabdusf(fn,t harregasd-s>)r,e ctvibduIfnfB,l o c| k ^( threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I202d:x53.:x )note: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here grou p202( | g r o u p ) , R| u ^~~~~~~~~~~~~~~~~n Wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562E:l60e:m enote: nfield 'group' will be initialized after field 'stepSize't d(s)(.nrtuhnr(ewaed)s;) , | t ^i dInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppo:c6k:(1t:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea dIdx .6x | )I,M PgLr_oCuOpL(Lg_rFoUuNpC)(,A l l| R ^~~~~~~~~~~e duce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:G562O:_15#:# awarning: linitializer order does not match the declaration order [-Wreorder-ctor]g o, NCCL_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'd x.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~h r e| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d s(nt h563r | e a d s )s,t etpiSdiIzneB(lnoccckl(Sthhmreema.dcIodmxm..xb)u,f fgSriozueps([gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~~~~~~~_ SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:]60/:N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _STE P562S | / s i z etoifd((Tt)i)d ){, n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:)687,: 11t:i dnote: Iin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren Bloc k687( | t h r e a d I d x . xp)r,i mgsr(otuipd(-gtrioduSpt)a,r t B| c ^~~~~~~~~~~a st, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.houp(gr:o562u:p15):, warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:t562e:p15S:i zwarning: einitializer order does not match the declaration order [-Wreorder-ctor]( ncclShme m562. | c o m m .tbiudf(ftSiidz)e,s [nNtChCrLe_aPdRsO(TnOt_hSrIeMaPdLsE)],/ NtCiCdLI_nSBTlEoPcSk/(stihzreeoafd(ITd)x). x{) , | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 687 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 11 :| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 563 | 687 | s t e p S i z e (pnrcicmlsS(htmiedm-.tciodmSmt.abrutfBfcSaiszte,s [nNTChCrLe_aPdRsOBTcOa_sStI,M P&LdEi]r/eNcCtC-L>_oSuTtE,P Sn/uslilzpetorf,( Ta)r)g s{- > s| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n d b| u group(groupf f, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e666c:v9b:u fnote: fin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^ 666 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 :p53r:i mnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( tid, 202n | T h r e a d s G aRtuhneWro,r kdEilreemcetn-t>lsgeon,d bPurfoft,o >a(r)g.sr-u>nr(ewcev)b;u f f| , ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h1::202 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | : 562 : 15t:i dwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t id), nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~I dx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60I:n Bnote: lfield 'group' will be initialized after field 'stepSize'o ck(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~s [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:T562):)15 :{ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n626t:h9r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (nthr e626a | d s ) , t i d IpnrBilmosc(kt(itdh-rteiaddSItdaxr.txS)c,a tgtreoru,p (ngTrhoruepa)d,s S c| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t t e| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), NULL ,563 | d i r e cstt-e>puSpi,z ea(rngcsc-l>Sshemnedmb.ucfofm,m .abrugfsf-S>irzeecsv[bNuCfCfL,_ P R| O ^T O_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/:N202C:C53L:_ Snote: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE PS/s i202z | e o f ( T ) ) {R u n| W ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o r k| E group(groupl ement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO p, Alg o666, | P r o t o > ( )p.rriumns((wtei)d;, n| T ^h readsGa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:h8e:r1,: dnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested herer ect- >8u | pI,M PNLU_LCLO,L La_rFgUsN-C>(sAelnldRbeudfufc,e ,a rCgOsL-L>NrEeTc_vDbIuRfEfC,T , | S ^I MPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:r202o:d53,: inote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret 64_t )202 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391u:n95W:o rnote: kexpanded from macro 'IMPL_COLL_FUNC'E lement< F391n | , TR,u nRWeodrOkp<,n cAcllgFou,n cP#r#oftuon>c(,) .tryupne(,w eF)u;n c #| # ^d evredo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppp:<7t:y1p:e >note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here NCCL_ A7L | GIOM_P#L#_aClOgLoL,_ FNUCNCCL(_APlRlORTeOd_u#c#ep,r oCtOoL>L(N)E.Tr_uDnI(R&EnCcTc,l SShImMePmL.Ew,o rPkr)o;d ,\ u i| n ^t 32_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562^: 15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | t391i | d ( tRiudn)W,o rnkt ,g rNoCuCpL(_gArLoGuOp_)#,# a l| g ^~~~~~~~~~~~~~~~~o , NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P60R:O Tnote: Ofield 'group' will be initialized after field 'stepSize'_ ##prot o562> | ( ) . r utni(d&(ntcicdl)S,h mnetmh.rweoardks)(;n t\h r e| a ^d s), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lnote: ofield 'nthreads' will be initialized after field 'tidInBlock'c k(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,s: (562 n:| t15 ^~~~~~~~~~~h: r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o60c:k (note: tfield 'group' will be initialized after field 'stepSize'h read I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads) ,563 | t i d I nsBtleopcSki(zteh(rnecacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~L _PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) nthr e563a | d s ( n tshtreepaSdisz)e,( ntcicdlISnhBmleomc.kc(otmhmr.ebaudfIfdSxi.zxe)s,[ NgCrCoLu_pP(RgOrToOu_pS)I,M P L| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~] / N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_ST E563P | S / s i zsetoefp(STi)z)e ({n c c| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S h m| e group(groupm .comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h[:N626C:C9L:_ Pnote: Rin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO TO_SI M626P | L E ] / N C C L _pSrTiEmPsS(/tsiidz-etoifd(STt)a)r t{S c a| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t e r| , group(group nThreadsSc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:t655t:e11r:, note: Nin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereU LL, d i655r | e c t - > u p , a rpgrsi-m>ss(etniddb-utfifd,S taarrgtReduces,- >nrTehcrvebaudfsfR,e d u| c ^e , null/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:t202r:,53 :& dnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ect -202> | o u t , a r g sR-u>nsWeonrdkbEulfefm,e natr rTe,c vRbeudfOfp,, A| l ^g o, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:o202>:(53):. rnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren (we) ;202 | | ^ R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppu:n8W:o1r:k Enote: lin instantiation of member function 'RunWork, 2, 2>::run' requested heree ment <8F | nI,M PTL,_ CROeLdLO_pF,U NACl(gAol,l RPerdoutcoe>,( )C.OrLuLnN(EwTe_)D;I R E| C ^T , SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppP:L8E:,1 :P rnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested hered , i n8t | 6I4M_PtL)_ C O| L^L _FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:A391l:l95R:e dnote: uexpanded from macro 'IMPL_COLL_FUNC'c e, CO L391L | N E TR_uDnIWRoErCkT<,n cScIlMFPuLnEc,# #Pfruondc,, itnytp6e4,_ tF)u n c| #^# dev/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391d:o95p:< tnote: yexpanded from macro 'IMPL_COLL_FUNC'p e>, N C391C | L _ ARLuGnOW_o#r#ka().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | :I562M:P15L:_ Cwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]L L_FUNC(AllR e562d | u c e , tCiOdL(LtNiEdT)_,D InRtEhCrTe,a dSsI(MnPtLhEr,e aPdrso)d,, tuiidnItn3B2l_otc)k ( t| h^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:.391x:)95,: gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up(gr o391u | p ) ,R u n| W ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o r k| < tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n cclF u563n | c##fu n c , sttyeppeS,i zFeu(nncc#c#ldSehvmreemd.ocpof,f SNiCzCeLs_[ANLCGCOL__#P#RaOlTgOo_,S INMCPCLLE_]P/RNOCTCOL__#S#TpErPoSt/os>i(z)e.orfu(nT()&)n c{c l S| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e m| . group(groupw ork); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^: 687:11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 562:15: note: 687field 'nthreads' will be initialized after field 'tidInBlock' | 562 | ptriidm(st(itdi)d,- tnitdhSrteaardtsB(cnatshtr,e andTsh)r,e atdisdBIcnaBslto,c k&(dtihrreecatd-I>doxu.tx,) ,n uglrloputpr(,g raorugps)-,> s e| n ^~~~~~~~~~~~~~~~~d buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 :a60r:g snote: -field 'group' will be initialized after field 'stepSize'> recv b562u | f f , t| i ^d (tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres (nt h202r | e a d s ) , t iRduInnWBolrokcEkl(etmherneta ().run(we); | ^ cl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppF:u8n:c1#:# fnote: uin instantiation of member function 'RunWork, 2, 2>::run' requested heren c, t8y | pIeM,P LF_uCnOcL#L#_dFeUvNrCe(dAolple,, NCCOCLLL_NAELTG_OD_I#R#EaClTg,o ,S INMCPCLLE_,P RPOrToOd_,# #ipnrto6t4o_>t()) . r| u^n (&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:l391S:h95m:e mnote: .expanded from macro 'IMPL_COLL_FUNC'w ork); 391\ | | R ^u nWorke,a dNsC(CnLt_hArLeGaOd_s#)#,a ltgiod,I nNBClCoLc_kP(RtOhTrOe_a#d#Ipdrxo.txo)>,( )g.rrouunp((&gnrcoculpS)h,m em.work )| ; ^~~~~~~~~~~~~~~~~ \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'group' will be initialized after field 'stepSize': 562:15: note: 562field 'nthreads' will be initialized after field 'tidInBlock' | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~u p), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:(15n:t hwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]e ads), tid I562n | B l o c kt(itdh(rtid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562: 158: | Iwarning: Minitializer order does not match the declaration order [-Wreorder-ctor]P L_COLL_F U562N | C ( A l ltRiedd(utcied,) ,C OnLtLhNrEeTa_dDsI(RnEtChTr,e aSdIsM)P,L Et,i dPIrnoBdl,o cikn(tt6h4r_eta)d I d| x^. x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r391o:u95p:( gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up), 391| | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Work< n563c | c l F u nsct#e#pfSuinzce,( ntcycpleS,h mFeumn.cc#o#mdme.vbruefdfoSpiC,C LN_CPCRLO_TAOL_GSOI_M#P#LaEl]g/oN,C CNLC_CSLT_EPPRSO/TsOi_z#e#opfr(oTt)o)> ({) . r| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n ( &| n group(groupc clShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h.:w626o:r9k:) ;note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here\ | ^ 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 :p rnote: ifield 'nthreads' will be initialized after field 'tidInBlock'm s(ti d562- | t i d S ttairdt(Stciadt)t,e rn,t hnrTehardesa(dnstShcraetatdesr),, NUL Lt,i ddIinrBelcotc-k>(utph,r eaardgIsd-x>.sxe)n,d bgurfofu,p (agrrgosu-p>)r,e c v| b ^~~~~~~~~~~~~~~~~u ff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^60 : note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202562: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret id(ti d202) | , n t h r e a dRsu(nnWtohrrkeEaldesm)e,n tto(u)p.(rgurno(uwpe)),; | | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hck:562(:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d Idx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~t hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.heads):,562 :t15i:d Iwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]B lock(threadIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r e a| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s (nth r563e | a d s ) ,s tteipdSIinzBel(oncckc(ltShhrmeeamd.Icdoxm.mx.)b,u fgfrSoiuzpe(sg[rNoCuCpL)_,P R O| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O _ S| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PLE]/N C563C | L _ S T EsPtSe/psSiizzeeo(fn(cTc)l)S h{m e m| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c o m| m group(group. buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h[:N655C:C11L:_ Pnote: Rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO TO_SI M655P | L E ] / N C C L _ S TpErPiSm/ss(itziedo-ft(iTd)S)t a{r t R| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d u c| e group(group, nThreadsRedu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:e626,: 9n:u lnote: lin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep tr, & d626i | r e c t - > o u tp,r iamrsg(st-i>ds-etniddbSutfafr,t Sacragtst-e>rr,e cnvTbhurfef, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:a202d:s53S:c anote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret er, N202U | L L , d i r e cRtu-n>Wuopr,k Ealregmse-n>tsArlegcov,b uPfrfo,t o >| ( ^) .run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 202 :| 53 ^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :2028 | : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here R u8n | WIoMrPkLE_lCeOmLeLn_tFE(C)T.,r uSnI(MwPeL)E;, P| r ^o d, int6/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp4:_8t:)1 : | note: ^in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :8391 | :I95M:P Lnote: _expanded from macro 'IMPL_COLL_FUNC'C OLL_F U391N | C ( ARlulnRWeodrukc ,| ^N CCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA:L391G:O95_:# #note: aexpanded from macro 'IMPL_COLL_FUNC'l go, N C391C | L _ PRRuOnTWOo_r#k#u(n)c.#r#ufnu(n&cn,c ctlySphem,e mF.uwnocr#k#)d;e v\r e d| o ^p :,562 :N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'A LGO_ #562# | a l g o ,t iNdC(CtLi_dP)R,O TnOt_h#r#epardost(on>t(h)r.eraudns()&,n ctcildSIhnmBelmo.cwko(rtkh)r;e a\d I d| x ^. x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15(:g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p), 562| | ^~~~~~~~~~~~~~~~~ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t60i:d )note: ,field 'group' will be initialized after field 'stepSize' nthrea d562s | ( n t h rteiadd(st)i,d )t,i dnItnhBrleoacdks((tnhtrheraedaIddsx).,x )t,i dgIrnoBulpo(cgkr(otuhpr)e,a d I| d ^~~~~~~~~~~~~~~~~x .x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :g562r:o60u:p (note: gfield 'group' will be initialized after field 'stepSize'r oup), 562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lgo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElementwarning: ()initializer order does not match the declaration order [-Wreorder-ctor]. run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp562: | 7 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid )7, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pP(rgordo,u pu)i,n t 3| 2 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ t )| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h563: | 391 : 95 : snote: texpanded from macro 'IMPL_COLL_FUNC'e pSize (391n | c c lRSuhnmWeomr.kcS,/ sNiCzCeLo_fA(LTG)O)_ #{# a l| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o , | N group(groupC CL_PROTO_##proto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):.626r:u9n:( ¬e: nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec clShmem .626w | o r k ) ; \ p| r ^i ms(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:-15t:i dnote: Sfield 'nthreads' will be initialized after field 'tidInBlock't artS c562a | t t e r ,t indT(htrieda)d,s Snctahtrteeard,s (NnUtLhLr,e addisr)e,c tt-i>duIpn,B laorcgks(-t>hsreenaddbIudfxf.,x )a,r ggsr-o>urpe(cgvrbouufpf),, | | ^ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :field 'group' will be initialized after field 'stepSize'202 :53: note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | t i202d | ( t i d ) , n tRhurneWaodrsk(Enltehmreenatd)(,) .grruonu(pw(eg)r;o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht64_t:)562 : 15| :^ warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ P| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O TO_## p563r | o t o > (s)t.erpuSni(z&en(cncclcSlhSmhemme.mw.ocrokm)m;. b\u f f| S ^i zes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:P Rnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'T O_SI M562P | L E ] / Nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hCi:Cd562L(:_t15Si:Td E)warning: P,initializer order does not match the declaration order [-Wreorder-ctor]S /nstihzr ee562oa | fd (s T( )n )tt hi{rd e( at| di ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~sd )) ,,| group(grouptn itdhIrneBaldo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hsc:(k626n(:tt9hh:r renote: eain instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heread dsI)d, x 626.t | xi )d ,I n gB rl oo uc pkp((rgtirhmorsue(pat)di,Id d- xt| .i ^~~~~~~~~~~~~~~~~xd )S,t /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hag:rr562to:Su60cp:a(t gtnote: refield 'group' will be initialized after field 'stepSize'or u,p )n,T h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c atte r563, | N U L Ls,t edpiSriezcet(-n>cucpl,S hamregms.-c>osmemn.dbbuuffffS,i zaersg[sN-C>CrLe_cPvRbOuTfOf_,S I M| P ^L E]/NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:P202S:/53s:i znote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo f(T)) 202{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group RunWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:e626m:e9n:t , FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren , T, 626R | e d O p , A l gpor,i mPsr(ottiod>-(t)i.drSutna(rwteS)c;a t t| e ^r , nThrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:s9S:c1a:t tnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herer , NUL L9, | IdMiPrLe_cCtO-L>Lu_pF,U NaCr(gAsl-l>Rseednudcbeu,f fC,O LaLrNgEsT-_>DrIeRcEvCbTu,f fS,I M P| L ^E , Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:d202,: 53u:i nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here6 4_t) 202 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u391n:W95o:r knote: Eexpanded from macro 'IMPL_COLL_FUNC'l ementn(c),. rtuynp(ew,e )F;u n c| # ^# devre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:o10p:<1t:y pnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested here> , NC C10L | _IAMLPGLO__C#O#LaLl_gFoU,N CN(CAClLl_RPeRdOuTcOe_,# #CpOrLoLtNoE>T(_)D.IrRuEnC(T&,n cScIlMSPhLmEe,m .Pwroordk,) ;h a\l f )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::562391::1595:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'expanded from macro 'IMPL_COLL_FUNC' 562391 | | R utniWdo(rtkid,x .NxC)C,L _gArLoGuOp_(#g#raolugpo),, N C| C ^~~~~~~~~~~~~~~~~L _PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_60#:# pnote: rfield 'group' will be initialized after field 'stepSize'o to>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562&:d15i:r ewarning: cinitializer order does not match the declaration order [-Wreorder-ctor]t ->out, a562r | g s - > steindd(btuifdf),, anrtghsr-e>ardesc(vnbtuhfrfe,a d s| ) ^, tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hB:l202o:c53k:( tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eadI d202x | . x ) , g r o uRpu(ngWroorukpE)l,e m e| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t < F| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), T, R e563d | O p , Asltgeop,S iPzreo(tnoc>c(l)S.hrmuenm(.wceo)m;m . b| u ^f fSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppN:C9C:L1_:P Rnote: Oin instantiation of member function 'RunWork, 2, 2>::run' requested hereT O_SI M9P | LIEM]P/LN_CCCOLL_LS_TFEUPNSC/(sAilzleRoefd(uTc)e), {C O L| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N E T| _ group(groupD IRECT, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hI:M666P:L9E:, note: Pin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer od, u666i | n t 6 4 _ t ) p| r^i ms(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391,: 95n:T hnote: rexpanded from macro 'IMPL_COLL_FUNC'e adsGa t391h | e r ,R udniWroerckt<-n>cucpl,F uNnUcL#L#, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]( threadIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~a ds(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d snote: )field 'group' will be initialized after field 'stepSize', tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I n| B tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l ock(t h563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~. buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElementwarning: (initializer order does not match the declaration order [-Wreorder-ctor]) .run(we )562; | | ^ tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:)9,: 1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads( n9t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ICdOxL.LxN)E,T _gDrIoRuEpC(Tg,r oSuIpM)P,L E ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~P r o| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), uin t5636 | 4 _ t ) s t| e^p Size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391c:c95l:S hnote: mexpanded from macro 'IMPL_COLL_FUNC'e m.comm. b391u | f f SRiuzneWso[rNkC| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ N C| C group(groupL _ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h#:a666l:g9o:, note: Nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC CL_P R666O | T O _ # # p r o tpor>i(m)s.(rtuind(,& nncTchlrSehamdesmG.awtohrekr),; d\i r e| c ^t ->up,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562U:L15L:, note: afield 'nthreads' will be initialized after field 'tidInBlock'r gs->s e562n | d b u f ft,i da(rtgisd-)>,r enctvhbruefafd,s ( n| t ^h reads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202t:i53d:I nnote: Bin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herel ock( t202h | r e a d I d x . xR)u,n WgorrokuEpl(egmreonutp<)F,n , | T ^~~~~~~~~~~~~~~~~, Red/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:p562,: 60A:l gnote: ofield 'group' will be initialized after field 'stepSize', Prot o562> | ( ) . r utni(dw(et)i;d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppd:s9(:n1t:h rnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herea d 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~i d(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:)562,: 60n:t hnote: rfield 'group' will be initialized after field 'stepSize'e ads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d I| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x .x), g563r | o u p ( gsrtoeuppS)i,z e (| n ^~~~~~~~~~~c clShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ threads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ go, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ od, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n60t:h rnote: efield 'group' will be initialized after field 'stepSize'a ds), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~l o c| k tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( t h563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~. buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT_DIR:E562C:T15,: Swarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]M PLE, Prod, ha l562f | ) | ^t id(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:)391,: 95n:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e ads(n t391h | r e aRdusn)W,o rtki| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) NCCL_ A563L | G O _ # #satlegpoS,i zNeC(CnLc_cPlRSOhTmOe_m#.#cpormomt.ob>u(f)f.Sriuzne(s&[nNcCcClLS_hPmReOmT.Ow_oSrIkM)P;L E\] / N| C ^C L_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:S562/:s15i:z enote: ofield 'nthreads' will be initialized after field 'tidInBlock'f (T)) {562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| i group(groupd (tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:(666n:t9h:r enote: ain instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s), t666i | d I n B l o c k (ptrhirmesa(dtIiddx,. xn)T,h rgeraoduspG(agtrhoeurp,) ,d i r| e ^~~~~~~~~~~~~~~~~c t->u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:,562 :N60U:L Lnote: ,field 'group' will be initialized after field 'stepSize' args- >562s | e n d b utfifd,( tairdg)s,- >nrtehcrvebaudfsf(,n t h| r ^e ads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202I:n53B:l onote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herek (th r202e | a d I d x . x ) ,R ugnrWoourpk(Eglreomuepn)t,< F n| , ^~~~~~~~~~~ T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hLE, P:r562o:d15,: hwarning: ainitializer order does not match the declaration order [-Wreorder-ctor]l f) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(tid )391, | n tRhurneWaodrsk(g,r oNuCpC)L,_ A L| G ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O _ #| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a lgo, 563N | C C L _ PsRtOeTpOS_i#z#ep(rnoctcol>S(h)m.ermu.nc(o&mnmc.cbluSfhfmSeimz.ewso[rNkC)C;L _\P R O| T ^O _SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:L562E:]15/:N Cnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'L _STE P562S | / s i z etoifd((Tt)i)d ){, n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d687I:n11B:l onote: cin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herek (thre a687d | I d x . x ) , g r opurpi(mgsr(otuipd)-,t i d| S ^~~~~~~~~~~~~~~~~t art/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:c562a:s60t:, note: nfield 'group' will be initialized after field 'stepSize'T hrea d562s | B c a s tt,i d&(dtiirde)c,t -n>tohurte,a dnsu(lnltphtrre,a dasr)g,s -t>isdeInndBbluofcfk,( tahrrgesa-d>Irdexc.vxb)u,f fg,r o u| p ^( group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hNCCL_:P562R:O15T:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# proto>().run(&n c562c | l S h m etmi.dw(otrikd));, \n t h| r ^e ads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: )field 'nthreads' will be initialized after field 'tidInBlock', tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d I| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)B lock (563t | h r e a dsItdexp.Sxi)z,e (gnrcoculpS(hgmreomu.pc)o,m m .| b ^~~~~~~~~~~~~~~~~u ffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:s60[:N Cnote: Cfield 'group' will be initialized after field 'stepSize'L _PRO T562O | _ S I M PtLiEd](/tNiCdC)L,_ SnTtEhPrSe/asdisz(enotfh(rTe)a)d s{) , | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d I| n group(groupB lock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677g:r11o:u pnote: (in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg roup) ,677 | | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou, nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, pNCCL_)A,L G O| _ ^~~~~~~~~~~~~~~~~# #algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:O562T:O60_:# #note: pfield 'group' will be initialized after field 'stepSize'r oto>().r u562n | ( & n ticdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bnote: lfield 'nthreads' will be initialized after field 'tidInBlock'o ck(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~h reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:c562e:,15 :C Owarning: Linitializer order does not match the declaration order [-Wreorder-ctor]L NET_DIRE C562T | , S I MtPiLdE(,t iPdr)o,d ,n tfhlroeaatd)s ( n| t^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 :t95i:d Inote: nexpanded from macro 'IMPL_COLL_FUNC'B lock(th r391e | a d IRduxn.Wxo)r,k n,c cNlCSChLm_eAmL.GcOo_m#m#.ablugfof,S iNzCeCsL[_NPCRCOLT_OP_R#O#TpOr_oStIoM>P(L)E.]r/uNnC(C&Ln_cScTlESPhSm/esmi.zweoorfk()T;) )\ { | ^| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'626 :9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 626t | i d ( t i d ) , pnrtihmrse(atdisd(-nttihdrSetaadrst)S,c attitdeIrn,B lnoTchkr(etahdrseSacdaItdtxe.rx,) ,N UgLrLo,u pd(igrreocutp-)>,u p ,| ^~~~~~~~~~~~~~~~~a rgs->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:e562n:d60b:u fnote: ffield 'group' will be initialized after field 'stepSize', args- >562r | e c v b utfifd,( t i| d ^) , nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hre a202d | s ) , t i d I nRBulnoWcokr(ktEhlreemaednItd().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:t562o:>15(:) .warning: rinitializer order does not match the declaration order [-Wreorder-ctor]u n(&ncclS h562m | e m . w otrikd)(;t i\d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), t562i | d I n B ltoicdk((ttihreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp::56212::151:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_ C562O | L L _ F UtNiCd((AtlildR)e,d uncteh,r eCaOdLsL(NnEtTh_rDeIaRdEsC)T,, tSiIdMIPnLBEl,o cPkr(otdh,r edaoduIbdlxe.)x ) ,| ^g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(391g:r95o:u pnote: )expanded from macro 'IMPL_COLL_FUNC', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 391 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | Ru n563W | o r k < nsctcelpFSuinzce#(#nfcucnlcS,h mteymp.ec,o mFmu.nbcu#f#fdSeivzreesd[oNpCO,T ON_CSCILM_PALLEG]O/_N#C#CaLl_gSoT,E PNSC/CsLi_zPeRoOfT(OT_)#)# p{r o t| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~> ( )| . group(groupr un(&ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hw:o687r:k11):; note: \in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h687: | 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' p r562i | m s ( t itdi-dt(itdiSdt)a,r tnBtcharseta,d sn(TnhtrheraedasdBsc)a,s tt,i d&IdniBrleocctk-(>tohurte,a dnIudlxl.pxt)r,, garrogusp-(>gsreonudpb)u,f f ,| ^~~~~~~~~~~~~~~~~a rgs/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562r:e60c:v bnote: ufield 'group' will be initialized after field 'stepSize'f f, | 562 ^ | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:(202t:i53d:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren thre a202d | s ( n t h r e a dRsu)n,W otrikdEIlneBmleonctk<(Ftnh,r eTa,d IRdexd.Oxp),, Aglrgoou,p (Pgrrootuop>)(,) . r| u ^~~~~~~~~~~n (we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562u:c15e:, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]u llptr, &d i562r | e c t - >toiudt(,t iadr)g,s -n>tshernedabdusf(fn,t harregasd-s>)r,e ctvibduIfnfB,l o c| k ^( threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:.202x:)53,: gnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo up(g r202o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)W orkEl e563m | e n t < Fsnt,e pTS,i zRee(dnOcpc,l SAhlmgeom,. cPormomt.ob>u(f)f.Sriuzne(sw[eN)C;C L _| P ^R OTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppP:L11E:]1/:N Cnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _STE P11S | /IsMiPzLe_oCfO(LTL)_)F U{N C (| A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l l R| e group(groupd uce, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:T641_:D11I:R Enote: Cin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT , SIM P641L | E , P r o d , f lporaitm)s ( t| i^d -tidS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:a391r:t95R:e dnote: uexpanded from macro 'IMPL_COLL_FUNC'c e, nTh r391e | a d sRRuendWuocrek,< ndcicrleFcutn-c>#d#ofwunn,c ,& dtiyrpeec,t -F>uonuct#,# daervgrse-d>ospef,, NaCrCgLs_-A>LrGeOc_v#b#uaflfg,o , | N ^C CL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:O202_:#53#:p rnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret o>(). r202u | n ( & n c c l S hRmuenmW.owrokrEkl)e;m e\n t <| F ^n , T, Re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:O562p:,15 :A lnote: gfield 'nthreads' will be initialized after field 'tidInBlock'o , Prot o562> | ( ) . r utni(dw(et)i;d ) ,| ^n threads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp(:n11t:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested heres ), t i11d | IInMBPlLo_cCkO(LtLh_rFeUaNdCI(dAxl.lxR)e,d ugcreo,u pC(OgLrLoNuEpT)_,D I R| E ^~~~~~~~~~~~~~~~~C T, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:M562P:L60E:, note: Pfield 'group' will be initialized after field 'stepSize'r od, f l562o | a t ) t| i^d (tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's (nthre a391d | s ) ,R utniWdoIrnkB, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1560:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads(nt h562r | e a d s )t,i dt(itdiIdn)B,l onctkh(rtehardesa(dnItdhxr.exa)d,s )g,r otuipd(IgnrBoluopc)k,( t h| r ^~~~~~~~~~~e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^ :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:60 :563 | note: field 'group' will be initialized after field 'stepSize' ste p562S | i z e ( ntcicdl(Sthimde)m,. cnotmhmr.ebaudfsf(Snitzherse[aNdCsC)L,_ PtRiOdTIOn_BSlIoMcPkL(Et]h/rNeCaCdLI_dSxT.ExP)S,/ sgirzoeuopf((gTr)o)u p{) , | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| ^~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562u:c15e:, warning: Cinitializer order does not match the declaration order [-Wreorder-ctor]O LLNET_DI R562E | C T , StIiMdP(LtEi,d )P,r ondt,h rdeoaudbsl(en)t h r| e^a ds),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95I:n Bnote: lexpanded from macro 'IMPL_COLL_FUNC'o ck(thr e391a | d I dRxu.nxW)o,r kgc,l SNhCmCeLm_.AcLoGmOm_.#b#uaflfgSoi,z eNsC[CNLC_CPLR_OPTROO_T#O#_pSrIoMtPoL>E(])/.NrCuCnL(_&SnTcEcPlSS/hsmiezme.owfo(rTk))); { | \ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::15677:: 11note: :field 'nthreads' will be initialized after field 'tidInBlock' note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 677 | t i d ( t i d ) , pnrtihmrse(atdisd(-nttihdrSetaadrst)B,c atsitd,I nnBTlhorceka(dtshBrceaasdtI,d x&.dxi)r,e cgtr-o>uopu(tg,r oduipr)e,c t -| > ^~~~~~~~~~~~~~~~~d own/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562a:r60g:s -note: >field 'group' will be initialized after field 'stepSize's endbu f562f | , a r gtsi-d>(rteicdv)b,u fnft,h r e| a ^d s(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:)53,: tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered InBl o202c | k ( t h r e a d IRduxn.Wxo)r,k Eglreomuepn(tg().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement (t)i.dr(utni(dw)e,) ;n t h| r ^e ads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpph:r12e:a1d:s )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here tidI n12B | lIoMcPkL(_tChOrLeLa_dFIUdNxC.(xA)l,l Rgerdouucpe(,g rCoOuLpL)N,E T _| D ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I R E| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T , SIM P563L | E , P rsotde,p Sdiozueb(lnec)c l S| h^m em.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:m391m:.95b:u fnote: fexpanded from macro 'IMPL_COLL_FUNC'S izes[N C391C | L _ PRRuOnTWOo_rSkI, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hN:C626C:L9_:A Lnote: Gin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO _##al g626o | , N C C L _ P RpOrTiOm_s#(#tpirdo-ttoi>d(S)t.arrutnS(c&antctcelrS,h mneTmh.rweoardks)S;c a\t t e| r ^, NULL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562d:i15r:e cnote: tfield 'nthreads' will be initialized after field 'tidInBlock'- >up, a562r | g s - > steindd(btuifdf),, anrtghsr-e>ardesc(vnbtuhfrfe,a d s| ) ^, tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:n202B:l53o:c knote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hrea d202I | d x . x ) , g rRouunpW(ogrrkoEulpe)m,e n t| < ^~~~~~~~~~~~~~~~~F n,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :T562,: 60R:e dnote: Ofield 'group' will be initialized after field 'stepSize'p , Alg o562, | P r o ttoi>d(()t.irdu)n,( wnet)h;r e a| d ^s (/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppc:k13(:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered Idx.x), g r13o | uIpM(PgLr_oCuOpL)L,_ F U| N ^~~~~~~~~~~C (AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:>562(:)15.:r uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]( &ncclShme m562. | w o r k )t;i d\( t i| d ^) , nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c k (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h read I563d | x . x ) ,s tgerpoSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12::1562:: 15note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here warning: initializer order does not match the declaration order [-Wreorder-ctor] 12 | IMPL_COLL _562F | U N C ( AtlildR(etdiudc)e,, nCtOhLrLeNaEdTs_(DnItRhErCeTa,d sS)I,M PtLiEd,I nPBrloodc,k (dtohurbelaed)I d x| .^x ), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391u:p95(:g rnote: oexpanded from macro 'IMPL_COLL_FUNC'u p), | 391 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u nWor k563< | n c c l Fsutnecp#S#ifzuen(cn,c ctlySphem,e mF.ucnocm#m#.dbeuvfrfeSdiozpeL,_ PNRCOCTLO__ASLIGMOP_L#E#]a/lNgCoC,L _NSCTCELP_SP/RsOiTzOe_o#f#(pTr)o)t o{> ( )| . ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r u n| ( group(group& ncclShmem.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:)666;: 9\: note: | in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h666: | 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' pri m562s | ( t i d ,t indT(htrieda)d,s Gnatthhreera,d sd(inrtehcrte-a>dusp),, NtUiLdLI,n Balrogcsk-(>tshernedabduIfdfx,. xa)r,g sg-r>oruepc(vgbruofufp,) , | ^| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::20260::53 :note: field 'group' will be initialized after field 'stepSize'note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562202 | | t i d ( tRiudn)W,o rnktEhlreemaednst(a(d)I.drxu.nx()w,e )g;r o u| p ^( group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp : 11| : ^~~~~~~~~~~1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'group' will be initialized after field 'stepSize': 562:15: warning: 562initializer order does not match the declaration order [-Wreorder-ctor] | ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~o up), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :&562d:i15r:e cwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]- >out, args- >562s | e n d b utfifd,( tairdg)s,- >nrtehcrvebaudfsf(,n t h| r ^e ads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:I53n:B lnote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec k(t h202r | e a d I d x . x )R,u ngWroorukpE(lgermoeunpt)<,F n ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T , | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e dOp, 563A | l g o , sPtreoptSoi>z(e)(.nrcucnl(Swhem)e;m . c| o ^m m.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppi:z12e:s1[:N Cnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _PRO T12O | _ISMIPMLP_LCEO]L//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hLN_CFC:UL562N_:CS15(T:AE lPwarning: lSinitializer order does not match the declaration order [-Wreorder-ctor]R/ esdiuzceeo ,f562 ( | CT O) L) L N{tE iT d_| (D ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tI iR dE| )C group(group,T ,n tShI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hrM:eP666aL:dE9,s: ( Pnnote: rtin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereoh dr,e a666dd | os u) b, l et )i d I | np^Br liomcsk((/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.htt:ih391dr:,e95 a:nd TInote: hdexpanded from macro 'IMPL_COLL_FUNC'rx e.axd)s,G a gt391rh | oe ur p,R( ugdnriWorouerpck)t<,-n >c uc| pl ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~,F u Nn| Uc tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L# L#,f u563an | rc g, s -t >systpeeenp,dS biFuzufenf(c,n# c#acdrlegSvshr-me>edrmoe.pcc,u, f fN| SC ^iC zLe_s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA[:LN202GC:OC53_L:#_ #Pnote: aRin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herelO gT oO202,_ | S NI CM CP LL _E P] R/ ONRTCuOCn_LW#_o#SrpTkrEEPolSet/mose>in(zt)e<.oFrfnu(,nT ()T&),n c{Rc el dS| Oh ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pm ,e m| A. group(grouplw goor,k )P;r /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho\:t 666o :>| 9( ^:) .note: rin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw :e666562) | :; 15 : | note: ^ field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppp:r13i :m5621s | :( t note: i in instantiation of member function 'RunWork, 2, 2>::run' requested hered ,t in13dT | (hItrMiePdaL)d_,sC GOnaLttLhh_reFerUa,Nd Csd((iAnrltelhcRtre-ed>auudcpse,), , N CUtOLiLLdL,IN nEaBTrl_goDscI-kR>(EstCehTnr,de baSudIfIMfdP,xL .Eax,r) g,Ps r-go>rdro,eu cprv(cbgcurlfo_fub,pf )l ,o| a ^ t | 1 ^~~~~~~~~~~~~~~~~6 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 202 :| /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h53^:: 562 :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: 60:in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here:391 :note: 95field 'group' will be initialized after field 'stepSize'202: | note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | R u nRtWuiondrW(kotErilkde<)mn,ec ncntltvc(rk)e(.dtrohuprnx ,. x| N) ^C, C Lg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp_r:Ao12Lu:Gp1O(:_g #rnote: #oin instantiation of member function 'RunWork, 2, 2>::run' requested hereau lpg) o,12, | IN| MC ^~~~~~~~~~~PC LL__CPORLOLT_OF_U#N#Cp(rAoltloR>e(d)u.creu,n (C&OnLcLcNlESTh_mDeImR.EwCoTr,k )S;I M\P L E| , ^ Prod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562d:o15u:b lnote: efield 'nthreads' will be initialized after field 'tidInBlock') | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nt h391r | e a dRsu(nnWtohrrke),, N C| C ^~~~~~~~~~~~~~~~~L _A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:G562O:_60#:# anote: lfield 'group' will be initialized after field 'stepSize'g o, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork ,warning: initializer order does not match the declaration order [-Wreorder-ctor]N CCL_ALGO _562# | # a l g ot,i dN(CtCiLd_)P,R OnTtOh_r#e#apdrso(tnot>h(r)e.ardusn)(,& ntcicdlISnhBmleomc.kw(otrhkr)e;a d\I d x| . ^x ), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o unote: pfield 'nthreads' will be initialized after field 'tidInBlock') , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) t i563d | ( t i d )s,t enptSihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~~~~~~~i zeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:T562):)60 :{ note: field 'group' will be initialized after field 'stepSize'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:(666t:i9d:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren thre a666d | s ( n t h r e a dpsr)i,m st(itdiIdn,B lnoTchkr(etahdrseGaadtIhdexr.,x )d,i rgercotu-p>(ugpr,o uNpU)L,L , | a ^~~~~~~~~~~r gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs[:N562C:C15L:_ Pwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]O TO_SIMPLE ]562/ | N C C L _tSiTdE(PtSi/ds)i,z enotfh(rTe)a)d s{( n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d group(groups ), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:o626c:k9(:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree adIdx .626x | ) , g r o u p (pgrriomusp()t,i d -| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d S| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a rtSca t563t | e r , nsTtherpeSaidzseS(cnactctleSrh,m eNmU.LcLo,m md.ibruefcftS-i>zueps,[ NaCrCgLs_-P>RsOeTnOd_bSuIfMfP,L Ea]r/gNsC-C>rLe_cSvTbEuPfSf/,s i z| e ^o f(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :{202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~202 | | group(group RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:E626l:e9m:e nnote: tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here< Fn, T ,626 | R e d O p , A lpgroi,m sP(rtoitdo->t(i)d.Srtuanr(twSec)a;t t e| r ^, nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppr:e13a:d1s:S cnote: ain instantiation of member function 'RunWork, 2, 2>::run' requested heret ter ,13 | NIUMLPLL,_ CdOiLrLe_cFtU-N>Cu(pA,l laRregdsu-c>es,e nCdObLuLfNfE,T _aDrIgRsE-C>Tr,e cSvIbMuPfLfE,, P| r ^o d, rccl_b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:l202o:a53t:1 6note: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' RunWo r391k | E l eRmuennWtou(n)c.#r#udne(vwree)d;o p <| t ^y pe>, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppA:L13G:O1_:# #note: ain instantiation of member function 'RunWork, 2, 2>::run' requested herel go, N C13C | LI_MPPRLO_TCOO_L#L#_pFrUoNtCo(>A(l)l.Rreudnu(c&en,c cClOSLhLmNeEmT._wDoIrRkE)C;T ,\ S I| M ^P LE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :P562r:o15d:, note: rfield 'nthreads' will be initialized after field 'tidInBlock'c cl_ b562f | l o a t 1t6i)d ( t| i^d ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:s (note: nexpanded from macro 'IMPL_COLL_FUNC't hreads) ,391 | t i dRIunnBWloorckk<(ntchcrleFaudnIcd#x#.fxu)n,c ,g rtoyuppe(,g rFouunpc)#,# d e| v ^~~~~~~~~~~~~~~~~r ed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:p562<:t60y:p enote: >field 'group' will be initialized after field 'stepSize', NC C562L | _ A L G Ot_i#d#(atligdo),, NnCtChLr_ePaRdOsT(On_t#h#rperaodtso)>,( )t.irduInn(B&lnoccckl(Sthhmreema.dwIodrxk.)x;) ,\ g r| o ^u p(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 15 :| ^~~~~~~~~~~note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | t391i | d ( tRiudn)W,o rnkt ,g rNoCuCpL(_gArLoGuOp_)#,# a l| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o , | N tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C CL_P R563O | T O _ # #sptreoptSoi>z(e)(.nrcucnl(S&hnmcecml.Schommemm..bwuofrfkS)i;z e\s [ N| C ^C L_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I15M:P Lnote: Efield 'nthreads' will be initialized after field 'tidInBlock'] /NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t687i:d11I:n Bnote: lin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo ck(thr e687a | d I d x . x ) , g rporuipm(sg(rtoiudp-)t,i d S| t ^~~~~~~~~~~~~~~~~a rtBc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:s562t:,60 :n Tnote: hfield 'group' will be initialized after field 'stepSize'r eadsB c562a | s t , &tdiidr(etcitd-)>,o untt,h rneualdlsp(tnrt,h raeragdss-)>,s etniddbIunfBfl,o cakr(gtsh-r>eraedcIvdbxu.fxf),, g| r ^o up(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:)202,: 53 :| ^~~~~~~~~~~note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->recvbuff ,562 | | ^ tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 :n53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea ds( n202t | h r e a d s ) , RtuindWIonrBklEolcekm(etnhtr| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) . r| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n (we); 563| | ^ s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppt:e12p:S1i:z enote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heren ccl S12h | mIeMmP.Lc_oCmOmL.Lb_uFfUfNSCi(zAelsl[RNeCdCuLc_eP,R OCTOOL_LSNIEMTP_LDEI]R/ENCCTC,L _SSITMEPPLSE/,s iPzreoodf,( Td)o)u b{l e )| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :626:9: 391note: | in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWor k626< | n c c l F u n c #p#rfiumnsc(,t itdy-ptei,d SFtuanrct#S#cdaetvtreerd,o pnd,s SNcCaCtLt_eArL,G ON_U#L#La,l gdoi,r eNcCtC-L>_uPpR,O TaOr_g#s#-p>rsoetnod>b(u)f.fr,u na(r&gnsc-c>lrSehcmvebmu.fwfo,r k )| ; ^ \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::53562:: 15note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here note: field 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | t iRdu(ntWiodr)k,E lnetmhernetax(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :12:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h1::562 :note: 60in instantiation of member function 'RunWork, 2, 2>::run' requested here: note: field 'group' will be initialized after field 'stepSize' 12 | I562M | P L _ C OtLiLd_(FtUiNdC)(,A lnltRherdeuacdes,( nCtOhLrLeNaEdTs_)D,I RtEiCdTI,n BSlIoMcPkL(Et,h rPeraoddI,d xd.oxu)b,l eg)r o u| p^( group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,391 : 95| : ^~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h arg:s562-:>15s:e nwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]b uff, args->recvb u562f | f , | t ^i d(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres ), ti d202I | n B l o c k ( t hRruenaWdoIrdkxE.lxe)m,e ngtr563( | ) . r u ns(tweep)S;i z e| ( ^n cclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cppm:.13c:o1m:m .note: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ffSi z13e | sI[MNPCLC_LC_OPLRLO_TFOU_NSCI(MAPlLlER]e/dNuCcCeL,_ SCTOELPLSN/EsTi_zDeIoRfE(CTT),) S{I M P| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E , | P group(groupr od, rccl_bfloat1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h6:)641 : 11| :^ note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95 :641 | note: expanded from macro 'IMPL_COLL_FUNC' 391 | p r iRmusn(Wtoirdk-pdeo>w,n ,N C&CdLi_rAeLcGtO-_>#o#uatl,g oa,r gNsC-C>Ls_ePnRdObTuOf_f#,# parrogtso->>(r)e.crvubnu(f&fn,c c l| S ^h mem.work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h;: 202\: 53 :| ^note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' RunWo r562k | E l e m etnitd<(Ftni,d )T,, nRtehdrOepa,d sA(lngtoh,r ePardost)o,> (t)i.drIunnB(lwoec)k;( t h| r ^e adIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp :g13r:o1u:p (note: gin instantiation of member function 'RunWork, 2, 2>::run' requested herer oup), 13 | | I ^~~~~~~~~~~~~~~~~M PL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:C562O:L60L:_ Fnote: Ufield 'group' will be initialized after field 'stepSize'N C(Al l562R | e d u c et,i dC(OtLiLdN)E,T _nDtIhRrEeCaTd,s (SnItMhPrLeEa,d sP)r,o dt,i drIcncBll_obcfkl(otahtr1e6a)d I d| x^. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:(95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p ), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, null/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement562( | ) . r u nt(iwde()t;i d )| , ^ nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp(:n13t:h1r:e anote: din instantiation of member function 'RunWork, 2, 2>::run' requested heres ), ti d13I | nIBMlPoLc_kC(OtLhLr_eFaUdNICd(xA.lxl)R,e dgurcoeu,p (CgOrLoLuNpE)T,_ D I| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E C T| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) SIMP L563E | , P r osdt,e prSciczleo>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreadsElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_Aeduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | 67 warnings generated when compiling for host. tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562E:l15e:m ewarning: ninitializer order does not match the declaration order [-Wreorder-ctor]t r(e)a.drsu(nn(twher)e;a d s| ) ^, tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cppo:c4k:(1t:h rnote: ein instantiation of member function 'RunWork, 1, 2>::run' requested herea dIdx .4x | )I,M PgLr_oCuOpL(Lg_rFoUuNpC)(,R e d| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c e ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R ING, S563I | M P L E ,s tMeipnS,i zien(tn8c_ctl)S | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:#562p:r15o:t owarning: >initializer order does not match the declaration order [-Wreorder-ctor]( ).run(&ncc l562S | h m e m .twiodr(kt)i;d )\, n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock's ), ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I n B| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o ck(th r563e | a d I d xs.txe)p,S igzreo(unpc(cglrSohumpe)m,. c o| m ^~~~~~~~~~~~~~~~~m .buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:S562i:z60e:s [note: Nfield 'group' will be initialized after field 'stepSize'C CL_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d) s{) , | t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i d I| n group(groupB lock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.ha:d34I:d7x:. xnote: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, group (34g | r o u p ) , p r| i ^~~~~~~~~~~m s(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53I:d xnote: .in instantiation of member function 'RunWorkElement, 1, 2>::run' requested herex ), g r202o | u p ( g r o u p )R,u n W| o ^~~~~~~~~~~r kElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpA/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | rpgr,i m0s,( tairdg,s -n>tchornenaIdnsd,e x&,r ianrgg-s>-p>rceovn,n I&nrdienxg)-;> n e| x ^t , args->sendbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :a80r:g5s:- >note: rin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heree cvbuff ,80 | a r g s -r>urneRdiOnpgAoctoon>n(Ianrdgesx),; a r| g ^s ->connIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:e202x:)53;: note: | in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here ^ 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h | : 80 : 5 : note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here RunW o80r | k E l e mreunntR (Parrogtso)>;( ) .| r ^u n(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^202 :53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5: 1202: | note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | RIuMnPWLo_rCkOELlLe_mFeUnNtC<(FRne,d uTc,e ,R eRdIONpG,, ASlIgMoP,L EP,r oMtion>,( )u.irnutn8(_wte)) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp::957:: 1note: :expanded from macro 'IMPL_COLL_FUNC' note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 3917 | | I MRPuLn_WCoOrLkL<_nFcUcNlCF(uRnecd#u#cfeu,n cR,I NtGy,p eS,I MFPuLnEc,# #Mdienv,r eudionpt<3t2y_pte)> , | N^C CL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hA:L391G:O95_:# #note: aexpanded from macro 'IMPL_COLL_FUNC'l go, NC C391L | _ P RROuTnOW_o#r#kpF(u)nc##fu.nrcu,n (t&ynpcec,l SFhumnecm#.#wdoervkr)e;d o\p , NC C| L ^_ ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:g562o:,15 :N Cnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'L _PROTO _562# | # p r o ttoi>d(()t.irdu)n,( &nntchcrleSahdmse(mn.twhorreka)d;s )\, t| i ^d InBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:k562(:t15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd Idx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~r eads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize') , tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c60k:( tnote: hfield 'group' will be initialized after field 'stepSize'r eadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~s (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~o mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ pe, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h\: 562 :| 15 ^: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' tid(tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u p (| g tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r oup) ,563 | | ^~~~~~~~~~~~~~~~~ step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60(:n cnote: cfield 'group' will be initialized after field 'stepSize'l Shmem. c562o | m m . b utfifdS(itzieds)[,N CnCtLh_rPeRaOdTsO(_nStIhMrPeLaEd]s/)N,C CtLi_dSITnEBPlSo/cski(ztehorfe(aTd)I)dx.x ){, g| r ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o u p| ( group(groupg roup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h ^~~~~~~~~~~: 34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~d x.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r60o:u pnote: (field 'group' will be initialized after field 'stepSize'g roup) ,562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d (tid) ,563 | n t h r esatdesp(Snitzher(enacdcsl)S,h mteimd.IcnoBmlmo.cbku(ftfhSriezaedsI[dNxC.CxL)_,P RgOrToOu_pS(IgMrPoLuEp])/,N C C| L ^~~~~~~~~~~_ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing (warning: ainitializer order does not match the declaration order [-Wreorder-ctor]r gs); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202t:i53d:( tnote: iin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hered ), n t202h | r e a d s ( n t hRruenaWdosr)k,E lteimdeInntBr(o)u.pr)u,n ( w| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) ; | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp : 13 : 1s:t enote: pin instantiation of member function 'RunWork, 1, 2>::run' requested hereS ize (13n | cIcMlPSLh_mCeOmL.Lc_oFmUmN.Cb(uRfefdSuiczee,s [RNICNCGL,_ PSRIOMTPOL_ES,I MMPiLnE,] /rNcCcClL__bSfTlEoPaSt/1s6i)z e o| f^( T)) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h{: 391 :| 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: note: | expanded from macro 'IMPL_COLL_FUNC' group(group 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :R34u:n7W:o rnote: kin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here< ncclFu n34c | # # f u n c ,p rtiympse(,t iFdu,n cn#t#hdreevardesd,o p&>,p rNeCvC,L _&ArLiGnOg_-#>#naelxgto,, aNrCgCsL-_>PsReOnTdOb_u#f#fp,r oatrog>s(-)>.rreucnv(b&unfcfc,l Sahrmgesm-.>wroerdkO)p;A r\g , | 0 ^, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:c562o:n15n:I nnote: dfield 'nthreads' will be initialized after field 'tidInBlock'e x, ar g562s | - > c o ntniIdn(dteixd));, n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h(:n80t:h5r:e anote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested heres ), t80i | d I n B lroucnkR(itnhgr((garrogusp));, | | ^ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h60::202 :note: 53field 'group' will be initialized after field 'stepSize': note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 562 | 202 | t i d ( t i d )R,u nnWtohrrkeEaldesm(enntthd(x)..xr)u,n (gwreo)u;p ( g| r ^o up), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp ^~~~~~~~~~~: 12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( group), | 563 ^~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreaIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpps:)1,: In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d10I: nIn file included from B/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hl:o167c: k/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads( n563t | h r e a dsst)e,p StiizdeI(nnBclcolcSkh(mtehmr.ecaodmImd.xb.uxf)f,S igzreosu[pN(CgCrLo_uPpR)O,T O _| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I M P| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)E ]/NC C563L | _ S T E PsSt/espiSziezoef((nTc)c)l S{h m e| m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. c o| m group(groupm .buffSizes[NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:P655R:O11T:O _note: Sin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI MPLE]/N C655C | L _ S T E P S / s i zperoifm(sT()t)i d{- t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S t a| r group(groupt Reduce, nThr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a626d:s9R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, nu l626l | p t r , & d i rpercitm-s>(otuitd,- tairdgs->sSetnadrbtuSfcfa,t taerrg,s -n>TrhercevabdusfSfc,a t t| e ^r , NULL, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202c:t53-:> unote: pin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, args -202> | s e n d b u f f ,R uanrWgosr-k>Erleecmvebnutf, 2, 2>::run' requested here Prot o202> | ( ) . r u n ( w eR)u;n W o| r ^k Element, 2, 2>::run' requested heree dOp, 5A | lIgMoP,L _PCrOoLtLo_>F(U)N.Cr(uAnl(lwRee)d;u c e| , ^ COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppE:T4_:D1I:R Enote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereT , SI M4P | LIEM,P LM_aCxO,L Lu_iFnUtN8C_(tA)l l R| e^d uce, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:L391L:N95E:T _note: Dexpanded from macro 'IMPL_COLL_FUNC'I RECT, S I391M | P L ER,u nMWaoxr,k c,c lNFCuCnLc_#A#LfGuOn_c#,# atlygpoe,, NFCuCnLc_#P#RdOeTvOr_e#d#oppre(>),. rNuCnC(L&_nAcLcGlOS_h#m#eaml.gwoo,r kN)C;C L\_ P R| O ^T O_##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:t562o:>15(:) .note: rfield 'nthreads' will be initialized after field 'tidInBlock'u n(&nccl S562h | m e m . wtoirdk()t;i d\) , | n ^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), 562t | i d I n Btliodc(kt(itdh)r,e andtIhdrxe.axd)s,( ngtrhoruepa(dgsr)o,u pt)i,d I n| B ^~~~~~~~~~~~~~~~~l ock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e60a:d Inote: dfield 'group' will be initialized after field 'stepSize'x .x), g562r | o u p ( gtriodu(pt)i,d ) ,| ^~~~~~~~~~~~~~~~~n thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ un(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h).ru:n562(:&15n:c cwarning: linitializer order does not match the declaration order [-Wreorder-ctor]S hmem.work); \562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I d x| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x ), g r563o | u p ( g rsotuepp)S,i z e| ( ^~~~~~~~~~~~~~~~~n ccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e60m:. cnote: ofield 'group' will be initialized after field 'stepSize'm m.buf f562S | i z e s [tNiCdC(Lt_iPdR)O,T On_tShIrMePaLdEs](/nNtChCrLe_aSdTsE)P,S /tsiidzIenoBfl(oTc)k)( t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I d| x group(group. x), group(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h):,655 : 11| : ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreadMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs(nthr:e562a:d15s:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]t idInBlock(thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads) ,563 | t i d I nsBtleopcSki(zteh(rnecacdlISdhxm.exm).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L _ P| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)O TO_SI M563P | L E ] / NsCtCeLp_SSiTzEeP(Sn/cscilzSehomfe(mT.)c)o m{m . b| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~f f S| i group(groupz es[NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:O687_:S11I:M Pnote: Lin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereE ]/NCCL _687S | T E P S / s i z e o fp(rTi)m)s ({t i d| - ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| S group(groupt artBcast, nThread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:B641c:a11s:t ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here& direct -641> | o u t , n u l l p tprr,i masr(gtsi-d>-steinddSbtuafrft,R eadrugcse-,> rneTchvrbeuafdfs,R e d| u ^c e, dire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:t202-:>53d:o wnote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, &di r202e | c t - > o u t , RaurngWso-r>ksEelnedmbeunftf<,F na,r gTs,- >RreedcOvpb,u fAfl,g o ,| ^P roto>()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:u202n:(53w:e )note: ;in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 4 : 1 :R unote: nin instantiation of member function 'RunWork, 2, 2>::run' requested hereW orkE l4e | mIeMnPtL<_FCnO,L LT_,F URNeCd(OApl,l RAeldguoc,e ,P rCoOtLoL>N(E)T._rDuInR(EwCeT),; S I| M ^P LE, M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppa:x5,: 1i:n tnote: 8in instantiation of member function 'RunWork, 2, 2>::run' requested here_ t) | 5^ | IMPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:L391L:_95F:U Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'( AllRed u391c | e , RCuOnLWLoNrEkT<_nDcIcRlEFCuTn,c #S#IfMuPnLcE,, tMyapxe,, uFiunntc8#_#td)e v r| e^d op:,95 :N Cnote: Cexpanded from macro 'IMPL_COLL_FUNC'L _ALGO_ #391# | a l gRou,n WNoCrCkL<_nPcRcOlTFOu_n#c##p#rfoutnoc>,( )t.yrpuen,( &Fnucnccl#S#hdmeevmr.ewdoorpk<)t;y p\e > ,| ^N CCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:G562O:_15#:# anote: lfield 'nthreads' will be initialized after field 'tidInBlock'g o, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hreadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup(gIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^ :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h15::202 :warning: 53initializer order does not match the declaration order [-Wreorder-ctor]: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 562 | t i dR(utniWdo)r,k Enltehmreenatdr(e)a.drIudnx(.wxe)),; g r| o ^u p(group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp):,6 : 1| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 6 | I563M | P L _ C OsLtLe_pFSUiNzCe((AnlclcRleSdhumceem,. cCoOmLmL.NbEuTf_fDSIiRzEeCsT[,N CSCILM_PPLREO,T OM_aSxI,M PiLnEt]3/2N_CtC)L _ S| T^E PS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e391o:f95(:T )note: )expanded from macro 'IMPL_COLL_FUNC' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 391 | | group(group RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren c, ty p677e | , F u n c # # d e vprreidmosp(i,d SNtCaCrLt_BAcLaGsOt_,# #naTlhgroe,a dNsCBCcLa_sPtR,O T&Od_i#r#epcrto-t>oo>u(t),. rduinr(e&cntc-c>ldSohwmne,m .awrogrsk-)>;s e\n d b| u ^f f, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:s562-:>15r:e cnote: vfield 'nthreads' will be initialized after field 'tidInBlock'b uff, 562| | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:)53,: nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh read s202( | n t h r e a d sR)u,n WtoirdkIEnlBelmoecnkt(,( ) .| r ^~~~~~~~~~~~~~~~~u n(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:)562;: 60 :| ^note: field 'group' will be initialized after field 'stepSize' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp | : 6 : 1 :t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here( tid), 6n | tIhMrPeLa_dCsO(LnLt_hFrUeNaCd(sA)l,l RteidduIcneB,l oCcOkL(LtNhErTe_aDdIIRdExC.Tx,) ,S IgMrPoLuEp,( gMraoxu,p )i,n t 3| 2 ^~~~~~~~~~~_ t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhr:e562a:d15s:( nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads), tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~d InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tidPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s(nthreads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~c k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :p53r:i mnote: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( tid, 202n | T h r e a d s G aRtuhneWro,r kdEilreemcetn-t>lsgeon,d bPurfoft,o >a(r)g.sr-u>nr(ewcev)b;u f f| , ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h5: | 202I:M53P:L _note: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereO LL_FU N202C | ( A l l R e d u cReu,n WCoOrLkLENlEeTm_eDnItR ( )| .^r un(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;391 : 95| : ^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :3915 | : 1 :R unote: nin instantiation of member function 'RunWork, 2, 2>::run' requested hereW ork< n5c | cIlMFPuLn_cC#O#LfLu_nFcU,N Ct(yAplel,R eFduuncce#,# dCeOvLrLeNdEoTp_T,, NSCICMLP_LAEL,G OM_a#x#,a lugion,t 8N_CtC)L _ P| R^O TO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:p391r:o95t:o >note: (expanded from macro 'IMPL_COLL_FUNC') .run( &391n | c c lRSuhnmWeomr.kw(,t iNdC)C,L _nAtLhGrOe_a#d#sa(lngtoh,r eNaCdCsL)_,P RtOiTdOI_n#B#lporcokt(ot>h(r)e.arduInd(x&.nxc)c,l Sghrmoeump.(wgorroku)p;) ,\ | | ^~~~~~~~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 15field 'group' will be initialized after field 'stepSize': note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkt,h rNeCaCdLs_(AnLtGhOr_e#a#dasl)g,o ,t iNdCICnLB_lPoRcOkT(Ot_h#r#epardoItdox>.(x)).,r ugnr(o&unpc(cglrSohumpe)m,. w o| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k ) ;| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)\ | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562s:t15e:p Snote: ifield 'nthreads' will be initialized after field 'tidInBlock'z e(ncc l562S | h m e m .tciodm(mt.ibdu)f,f Snitzherse[aNdCsC(Ln_tPhRrOeTaOd_sS)I,M PtLiEd]I/nNBClCoLc_kS(TtEhPrSe/asdiIzdexo.fx()T,) )g r{o u p| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g r o| u group(groupp ), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562687::6011:: note: note: field 'group' will be initialized after field 'stepSize'in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 687 | t i d ( t i d )p,r inmtsh(rteiadd-st(indtShtraeratdBsc)a,s tt,i dnITnhBrleoacdks(Btcharseta,d I&ddxi.rxe)c,t -g>roouutp,( gnruolulpp)t,r , | a ^~~~~~~~~~~r gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Twarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ SIMPLE]/ N562C | C L _ S TtEiPdS(/tsiidz)e,o fn(tTh)r)e a{d s (| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t h r| e group(groupa ds), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hl:o641c:k11(:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree adIdx .641x | ) , g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~- t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S tartR e563d | u c e , sntTehprSeiazdes(RnecdculcSeh,m edmi.rceocmtm-.>bduofwfnS,i z&edsi[rNeCcCtL-_>PoRuOtT,O _aSrIgMsP-L>Es]e/nNdCbCuLf_fS,T EaPrSg/ss-i>zreeocfv(bTu)f)f ,{ | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 655note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here11 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 202 | 655 | R u n W o r k Eplreimmesn(tts(R)e.druucne(,w en)u;l l p| t ^r , &dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:c6t:-1>:o unote: tin instantiation of member function 'RunWork, 2, 2>::run' requested here, arg s6- | >IsMePnLd_bCuOfLfL,_ FaUrNgCs(-A>lrleRcevdbuucfef,, C O| L ^L NET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:C202T:,53 :S Inote: Min instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereP LE, M202a | x , i n t 3 2 _Rtu)n W o| r^k Eleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391<:F95n:, note: Texpanded from macro 'IMPL_COLL_FUNC', RedOp ,391 | A l gRou,n WPorroktc(c)l.Fruunnc(#w#ef)u;n c ,| ^t ype, F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:n7c:#1#:d enote: vin instantiation of member function 'RunWork, 2, 2>::run' requested herer edop <7t | yIpMeP>L,_ CNOCLCLL__FAULNGCO(_A#l#laRlegdou,c eN,C CCLO_LPLRNOETTO__D#I#RpErCoTt,o >S(I)M.PrLuEn,( &Mnacxc,l Suhimnetm3.2w_otr)k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h95::562 :note: 15expanded from macro 'IMPL_COLL_FUNC': note: field 'nthreads' will be initialized after field 'tidInBlock' 391 | 562 | R u n W otrikd<(ntcicdl)F,u nnct#h#rfeuandcs,( nttyhpree,a dFsu)n,c #t#iddeIvnrBeldoocpk<(ttyhpree>a,d INdCxC.Lx_)A,L GgOr_o#u#pa(lggroo,u pN)C,C L _| P ^~~~~~~~~~~~~~~~~R OTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:#562#:p60r:o tnote: ofield 'group' will be initialized after field 'stepSize'> ().ru n562( | & n c c ltSihdm(etmi.dw)o,r kn)t;h r\e a d| s ^( nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd InBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~) , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d swarning: (initializer order does not match the declaration order [-Wreorder-ctor]n threads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o c k| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread I563d | x . x ) ,s tgerpoSuipz(eg(rnocucpl)S,h m e| m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. c o| m tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)m .buf f563S | i z e s [sNtCeCpLS_iPzReO(TnOc_cSlISMhPmLeEm]./cNoCmCmL._bSuTfEfPSSi/zseisz[eNoCfC(LT_)P)R O{T O _| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~I M P| L group(groupE ]/NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:E655P:S11/:s inote: zin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree of(T)) 655{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prims(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:-666t:i9d:S tnote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer tRedu c666e | , n T h r e a dpsrRiemdsu(ctei,d ,n unlTlhprtera,d s&Gdaitrheecrt,- >doiurte,c ta-r>gusp-,> sNeUnLdLb,u fafr,g sa-r>gsse-n>drbeucfvfb,u fafr,g s -| > ^r ecvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 202202: | 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here R u202n | W o r k E l e m eRnutnA(l)g.or,u nP(rwoet)o;> ( )| . ^r un(we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 5| : ^1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp: 55: | 1I:M Pnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested here_ COLL _5F | UINMCP(LA_lClORLeLd_uFcUeN,C (CAOlLlLRNeEdTu_cDeI,R ECCOTL,L NSEITM_PDLIER,E CMTa,x ,S IuMiPnLtE8,_ tM)a x ,| ^u int8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:t391): 95 :| ^note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :39195 | : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWork< n391c | c l FRuunncW#o#rfkue,v rNeCdCoLp_#,# aNlCgCoL,_ ANLCGCOL__#P#RaOlTgOo_,# #NpCrCoLt_oP>R(O)T.Or_u#n#(p&rnoctcol>S(h)m.ermu.nw(o&rnkc)c;l S\h m e| m ^. work)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h;: 562\: 15 :| ^note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~r ou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d60(:t inote: dfield 'group' will be initialized after field 'stepSize') , nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15I:d xwarning: .initializer order does not match the declaration order [-Wreorder-ctor]x ), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 562:15 :202 | warning: initializer order does not match the declaration order [-Wreorder-ctor] Ru n562W | o r k E lteimde(nttid(I)n.Brluonc(kw(et)h;r e a| d ^I dx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp):,6 :g1r:o unote: pin instantiation of member function 'RunWork, 2, 2>::run' requested here( grou p6) | ,I M P| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ C O| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _FUN C563( | A l l R esdtuecpeS,i zCeO(LnLcNcElTS_hDmIeRmE.CcTo,m mS.IbMuPfLfES,i zMeasx[,N CiCnLt_3P2R_OtT)O _ S| I^M PLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C391C:L95_:S Tnote: Eexpanded from macro 'IMPL_COLL_FUNC'P S/size o391f | ( T )R)u n{W o r| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~< n c| c group(groupl Func##fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:,666 :t9y:p enote: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here Func# #666d | e v r e d o p < tpyrpiem>s,( tNiCdC,L _nATLhGrOe_a#d#saGlagtoh,e rN,C CdLirect->up, _NPURLOLT,O _a#r#gpsr-o>tsoe>n(d)b.urfufn,( &anrcgcsl-S>hrmeecmv.bwuofrfk,) ; | \ ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::202562::5315:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 202 | 562 | t i dR(utniWdo)r,k Enltehmreenatdr(e)a.drIudnx(.wxe)),; g r| o ^u p(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppu:p5):,1 : | note: ^~~~~~~~~~~~~~~~~in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :5562 | :I60M:P Lnote: _field 'group' will be initialized after field 'stepSize'C OLL_F U562N | C ( A l ltRiedd(utcied,) ,C OnLtLhNrEeTa_dDsI(RnEtChTr,e aSdIsM)P,L Et,i dMIanxB,l oucikn(tt8h_rte)a d I| d^x .x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:r391o:u95p:( gnote: rexpanded from macro 'IMPL_COLL_FUNC'o up), | ^~~~~~~~~~~391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:T562E:P15S:/ swarning: iinitializer order does not match the declaration order [-Wreorder-ctor]z eof(T)) {562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| i group(groupd (tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h626r:e9a:d snote: (in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren threa d626s | ) , t i d I n Bplroicmks((tthirde-atdiIddSxt.axr)t,S cgartotuepr(,g rnoTuhpr)e,a d s| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c a t| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e r, N U563L | L , d isrteecptS-i>zuep(,n cacrlgSsh-m>esme.ncdobmumf.fb,u fafrSgisz-e>sr[eNcCvCbLu_fPfR,O T O| _ ^S IMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/:N202C:C53L:_ Snote: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE PS/s i202z | e o f ( T ) ) {R u n| W ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o r k| E group(groupl ement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO p, Alg o677, | P r o t o > ( ) . rpurni(mwse()t;i d -| t ^i dStartB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppc:a6s:t1,: nnote: Tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh read s6B | cIaMsPtL,_ C&OdLiLr_eFcUtN-C>(oAultl,R eddiurceec,t -C>OdLoLwNnE,T _aDrIgRsE-C>Ts,e nSdIbMuPfLfE,, aMragxs,- >irnetc3v2b_utf)f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h95::202 :note: 53expanded from macro 'IMPL_COLL_FUNC': note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 391 | 202 | R u n W o r k e(>),. rNuCnC(Lw_eA)L;G O _| ##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hiz:e562s:[15N:C Cwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ PROTO_SIMP L562E | ] / N C CtLi_dS(TtEiPdS)/,s inztehorfe(aTd)s)( n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ) group(group, tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:k666(:t9h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered Idx. x666) | , g r o u p ( gprroiumps)(,t i d| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n T| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eadsG a563t | h e r , sdtierpeScitz-e>(unpc,c lNSUhLmLe,m .acrogmsm-.>bsuefnfdSbiuzfefs,[ NaCrCgLs_-P>RrOeTcOv_bSuIfMfP,L E ]| / ^N CCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:S202/:s53i:z enote: oin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref (T)) 202{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group RunWorkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:m666e:n9t:< Fnote: nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, T, R e666d | O p , A l g o ,p rPirmost(ot>i(d),. rnuTnh(rweea)d; | ^ sGa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppt:h6e:r1,: dnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested herer ect- >6u | pI,M PNLU_LCLO,L La_rFgUsN-C>(sAelnldRbeudfufc,e ,a rCgOsL-L>NrEeTc_vDbIuRfEfC,T , | S ^I MPLE, Max/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202i:n53t:3 2note: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret ) | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWork E391l | e m eRnutn (F)u.nrcu#n#(dweev)r;e d o| p ^< type>,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :N6C:C1L:_ Anote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested hereG O_## a6l | gIoM,P LN_CCCOLL_LP_RFOUTNOC_(#A#lplrRoetdou>c(e),. rCuOnL(L&NnEcTc_lDSIhRmEeCmT.,w oSrIkM)P;L E\, M| a ^x , in/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:35622:_15t:) note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nth r391e | a d sR(unntWhorreka,, N| C ^~~~~~~~~~~~~~~~~C L_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O60_:# #note: afield 'group' will be initialized after field 'stepSize'l go, N562C | C L _ P RtOiTdO(_t#i#dp)r,o tnot>h(r)e.ardusn((n&tnhcrcelaSdhsm)e,m .twiodrIkn)B;l o\c k (| t ^h readId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(gro u562p | ) , | t ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | i tdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 :s tnote: efield 'group' will be initialized after field 'stepSize'p Size(nc c562l | S h m e mt.icdo(mtmi.db)u,f fnStihzreesa[dNsC(CnLt_hPrReOaTdOs_)S,I MtPiLdEI]n/BNlCoCcLk_(StThErPeSa/dsIidzxe.oxf)(,T )g)r o{u p (| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grMax, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL_CO:L562L:_15F:U Nwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]( AllReduce, COL L562N | E T _ D ItRiEdC(Tt,i dS)I,M PnLtEh,r eMaadxs,( nitnhtr3e2a_dts)) , | t^i dInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c391k:(95t:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a dIdx.x )391, | g rRouunpW(ogrrko.,c oNmCmC.Lb_uAfLfGSOi_z#e#sa[lNgCoC,L _NPCRCOLT_OP_RSOITMOP_L#E#]p/rNoCtCoL>_(S)T.ErPuSn/(s&inzcecolfS(hTm)e)m .{w o r| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ; | \ group(group | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'655 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here562 | t655i | d ( t i d ) , n t hprreiamdss((tnitdh-rteiaddSst)a,r ttRieddIuncBel,o cnkT(htrheraedasdRIeddxu.cxe),, ngurloluppt(rg,r o&udpi)r,e c t| - ^~~~~~~~~~~~~~~~~> out,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562r:g60s:- >note: sfield 'group' will be initialized after field 'stepSize'e ndbuf f562, | a r g st-i>dr(etcivdb)u,f fn,t h r| e ^a ds(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Block (202t | h r e a d I d x .Rxu)n,W ogrrkoEulpe(mgernotu().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562 :| 15 ^~~~~~~~~~~: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prim/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_15#:# pwarning: rinitializer order does not match the declaration order [-Wreorder-ctor]o to>().ru n562( | & n c c ltSihdm(etmi.dw)o,r kn)t;h r\e a d| s ^( nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd InBlo c562k | ( t h r etaiddI(dtxi.dx)),, ngtrhoruepa(dgsr(onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i dInBl o563c | k ( t h rsetaedpISdixz.ex()n,c cglrSohumpe(mg.rcooumpm).,b u f| f ^~~~~~~~~~~~~~~~~S ize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:[562N:C60C:L _note: Pfield 'group' will be initialized after field 'stepSize'R OTO_ S562I | M P L E ]t/iNdC(CtLi_dS)T,E PnSt/hsriezaedosf((nTt)h)r e{a d s| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, t| i group(groupd InBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e687a:d11I:d xnote: .in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex ), gr o687u | p ( g r o u p ) , p| r ^~~~~~~~~~~i ms(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 562warning: | initializer order does not match the declaration order [-Wreorder-ctor] tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 60 :| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~note: field 'group' will be initialized after field 'stepSize' | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d (sttiedp)S,i znet(hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~o f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:(562T:)15): {warning: initializer order does not match the declaration order [-Wreorder-ctor] | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d626):,9 :n tnote: hin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads( n626t | h r e a d s ) , ptriidmIsn(Btliodc-kt(itdhSrteaardtISdcxa.txt)e,r ,g rnoTuhpr(egardosuSpc)a,t t e| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, N| U tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L L, di r563e | c t - > uspt,e paSrigzse-(>nscecnldSbhumfefm,. caormgms.-b>urfefcSvibzuefsf[,N C C| L ^_ PROTO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hM:P202L:E53]:/ Nnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC L_ST E202P | S / s i z e o f (RTu)n)W o{r k E| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e m e| n group(groupt , FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo , Pr o666t | o > ( ) . r u n (pwrei)m;s ( t| i ^d , nThr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:a7d:s1G:a tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested heree r, d i7r | eIcMtP-L>_uCpO,L LN_UFLULN,C (aArlglsR-e>dsuecned,b uCfOfL,L NaErTg_sD-I>RrEeCcTv,b uSfIfM,P L E| , ^ Max, u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:n202t:3532:_ tnote: )in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWork E391l | e m eRnutn#d(#()dt.erivudrn)e(,dw oenp)t<;ht ry ep| ae ^d> s,( nNtChCrL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe_:aA6dL:sG1)O:,_ #note: t#in instantiation of member function 'RunWork, 2, 2>::run' requested hereia dlIgno B,6l | oNIcCMkCP(LLt__hPCrROeOLaTLdO_I_Fd#Ux#N.pCxr()oA,tl olg>Rr(eo)du.uprc(ueg,nr (oC&uOnpLc)Lc,Nl ES Th| _m ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~De Im R.| Ew tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)Co Tr,k ) S;563I | M\ P L E| , ^s tMeapxS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,i: z562ie:n(15tn:3c 2cnote: _lfield 'nthreads' will be initialized after field 'tidInBlock'tS )h m e | m562^. | c o m m/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h .:tb391iu:df95(f:tS iinote: dzexpanded from macro 'IMPL_COLL_FUNC')e ,s [nNtChC rL391e_ | aP dR sOR(TunOnt_WhSorIreMkaP,:, 666 :N| 9C ^~~~~~~~~~~~~~~~~:C Lnote: _in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA :L562G:O60 _:666# | #note: afield 'group' will be initialized after field 'stepSize' l g o , 562N | Cp Cr Li _m PstR(iOtdTi(Odt_,i# d#n)pT,rh orntetoah>dr(se)Ga.adrtsuh(nen(rt&,hn rcdecialrdSeshc)mt,e- m>t.uiwpdo,Ir nkNB)Ul;Lo Lc\,k ( at| rh ^gr se-a>ds/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hIe:dn562xd:.b15xu:)f ,fnote: ,field 'nthreads' will be initialized after field 'tidInBlock'g raorugp s(562-g | >r ro eu cp v)tb,iu df (f| t, ^~~~~~~~~~~ i| d ^) , nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:(53n:t hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ads) ,202 | t i d I n B l o cRku(ntWhorrekaEdlIedmxe.nxt)<,F ng,r oTu,p (RgerdoOupp,) ,A l g| o ^~~~~~~~~~~~~~~~~, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:t562o:>60(:) .note: rfield 'group' will be initialized after field 'stepSize'u n(we )562; | | ^ tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppi:d7):,1 :n tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer eads (7n | tIhMrPeLa_dCsO)L,L _tFiUdNICn(BAllolcRke(dtuhcree,a dCIOdLxL.NxE)T,_ DgIrRoEuCpT(,g rSoIuMpP)L,E , | M ^~~~~~~~~~~a x, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15::562 :warning: 15initializer order does not match the declaration order [-Wreorder-ctor]: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~~~~~~~) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)60 : note: field 'group' will be initialized after field 'stepSize' 563 | 562s | t e p S itzied((ntcicdl)S,h mnetmh.rceoamdms.(bnutfhfrSeiazdess)[,N CtCiLd_IPnRBOlToOc_kS(ItMhPrLeEa]d/INdCxC.Lx_)S,T EgPrSo/uspi(zgeroofu(pT)),) {| ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]( nthreads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~o ck(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs(nt:h562r:e15a:d swarning: )initializer order does not match the declaration order [-Wreorder-ctor], tidInBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i60d:I nnote: Bfield 'group' will be initialized after field 'stepSize'l ock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~[ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->recv b562u | f f , t| i ^d (tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h202r:e53a:d snote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren thre a202d | s ) , t i d I nRBulnoWcokr(ktEhlreemaednItd tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( ).ru n563( | w e ) ; s t| e ^p Size(nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppc:l7S:h1m:e mnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herec omm. b7u | fIfMSPiLz_eCsO[LNLC_CFLU_NPCR(OATlOl_RSeIdMuPcLeE,] /CNOCLCLLN_ESTT_EDPISR/EsCiTz,e oSfI(MTP)L)E ,{ M a| x ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, u| i group(groupn t32_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :| 687^: 11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 391:95: note: 687expanded from macro 'IMPL_COLL_FUNC' | 391 | pRruinmWso(rtkieo>u,t ,N CnCuLl_lApLtGrO,_ #a#raglsg-o>,s eNnCdCbLu_fPfR,O TaOr_g#s#-p>rroetcov>b(u)f.fr,u n (| & ^n cclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:.202w:o53r:k )note: ;in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here \ | ^202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :R15u:n Wnote: ofield 'nthreads' will be initialized after field 'tidInBlock'r kEle m562e | n t < F nt,i dT(,t iRde)d,O pn,t hArlegaod,s (Pnrtohtroe>a(d)s.)r,u nt(iwdeI)n;B l o| c ^k (threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppd:I7d:x1.:x )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here grou p7( | gIrMoPuLp_)C,O L L| _ ^~~~~~~~~~~~~~~~~F UNC(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:l562l:R60e:d unote: cfield 'group' will be initialized after field 'stepSize'e , COL L562N | E T _ D ItRiEdC(Tt,i dS)I,M PnLtEh,r eMaadxs,( nutihnrte3a2d_st)), t| i^d InBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c391k:(95t:h rnote: eexpanded from macro 'IMPL_COLL_FUNC'a dIdx. x391) | , gRruonuWpo(rgkr, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid I563n | B l o c ks(ttehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s [ N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_PRO T563O | _ S I M PsLtEe]p/SNiCzCeL(_nScTcElPSSh/mseimz.ecoofm(mT.)b)u f{f S i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e s [| N group(groupC CL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:I641M:P11L:E ]note: /in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereN CCL_S T641E | P S / s i z e o f ( Tp)r)i m{s ( t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d - t| i group(groupd StartRe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:u626c:e9,: nnote: Tin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereh reads R626e | d u c e , d i rpercitm-s>(dtoiwdn-,t i&ddSitraercttS-c>aotutte,r ,a rngTsh-r>esaednsdSbcuaftft,e ra,r gNsU-L>Lr,e cdvibruefcft,- > u| p ^, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h-:>202s:e53n:d bnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref f, a202r | g s - > r e c v bRuufnfW,o r k| E ^l ement, 2, 2>::run' requested heree dOp, 202A | l g o , P r o tRou>n(W)o.rrkuEnl(ewmee)n;t < F| n ^, T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:d9O:p1,: Anote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereg o, P9r | oItMoP>L(_)C.OrLuLn_(FwUeN)C;( A l| l ^R educe, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppC:O7L:L1N:E Tnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereD IREC T7, | ISMIPMLP_LCEO,L LM_aFxU,N Cu(iAnltl6R4e_dtu)c e ,| ^C OLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hE:T391_:D95I:R Enote: Cexpanded from macro 'IMPL_COLL_FUNC'T , SIM P391L | E , RMuanxW,o rukin,c cNlCFCuLn_cA#L#GfOu_n#c#,a ltgyop,e NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::39115::95 :warning: initializer order does not match the declaration order [-Wreorder-ctor]note: expanded from macro 'IMPL_COLL_FUNC' 391562 | | R u ntWiodr(ktI,d xN.CxC)L,_ AgLrGoOu_p#(#garloguop,) ,N C C| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ P R| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)T O_##pr o563t | o > ( ) .srtuenp(S&ize(nnccccllSShhmmeemm..wcoormkm).;b u\f f S| i ^z es[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Tnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'_ SIMP L562E | ] / N C CtLi_dS(TtEiPdS)/,s inztehorfe(aTd)s)( n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ) group(group, tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:c666k:(9t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea dIdx. x666) | , g r o u p ( gprroiumps)(,t i d| , ^~~~~~~~~~~~~~~~~ nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60s:G anote: tfield 'group' will be initialized after field 'stepSize'h er, d562i | r e c t -t>iudp(,t iNdU)L,L ,n tahrrgesa-d>ss(enntdhbruefafd,s )a,r gtsi-d>IrneBclvobcukf(ft,h r e| a ^d Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202g:r53o:u pnote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg roup )202, | | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::15562:: 15warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 60 : note: sfield 'group' will be initialized after field 'stepSize't epSi z562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMP note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ LE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s15):, warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i dInBlock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~a ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSh 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ork); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d (tid), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g rou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r60o:u pnote: )field 'group' will be initialized after field 'stepSize', | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 562| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) ti d563( | t i d ) ,s tnetphSriezaed(sn(cnctlhSrhemaedms.)c,o mtmi.dbIunfBflSoiczke(st[hNrCeCaLd_IPdRxO.TxO)_,S IgMrPoLuEp](/gNrCoCuLp_)S,T E P| S ^~~~~~~~~~~/ sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(nccsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~B l o| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)k (thre a563d | I d x . xs)t,e pgSriozuep((ngcrcoluSph)m,e m .| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o m m| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)b uffSi z563e | s [ N C CsLt_ePpRSOiTzOe_(SnIcMcPlLSEh]m/eNmC.CcLo_mSmT.EbPuSf/fsSiizzeeosf[(NTC))C L{_ P R| O ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~T O _| S group(groupI MPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_687S:T11E:P Snote: /in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres izeof( T687) | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupp rims(tid-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i655d:S11t:a rnote: tin instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereB cast, n655T | h r e a d s B c a s tp,r i&mdsi(rteicdt--t>ioduStt,a rntuRleldputcre,, anrTghsr-e>asdesnRdebduufcfe,, anruglsl-p>trre,c v&bduifrfe,c t -| > ^o ut, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h-:>202s:e53n:d bnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heref f, a r202g | s - > r e c v b uRfufn,W o r| k ^E lement, 2, 2>::run' requested hered Op, A202l | g o , P r o t oR>u(n)W.orruknE(lweem)e;n t <| F ^n , T, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppe:d9O:p1,: Anote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereg o, P r9o | tIoM>P(L)_.CrOuLnL(_wFeU)N;C ( A| l ^l Reduce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :C10O:L1L:N Enote: Tin instantiation of member function 'RunWork, 2, 2>::run' requested here_ DIRE C10T | ,I MSPILM_PCLOEL,L _MFaUxN,C (uAilnltR6e4d_utc)e , | C^O LLNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:_391D:I95R:E Cnote: Texpanded from macro 'IMPL_COLL_FUNC', SIMP L391E | , MRauxn,W ohraklu,n cN#C#CfLu_nAcL,G Ot_y#p#ea,l gFou,n cN#C#CdLe_vPrReOdToOp_<#t#yppreo>t,o >N(C)C.Lr_uAnL(G&On_c#c#laSlhgmoe,m .NwCoCrLk_)P;R O\T O _| # ^# proto/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h>:(562):.15r:u nnote: (field 'nthreads' will be initialized after field 'tidInBlock'& ncclS h562m | e m . w otrikd)(;t i\d ) ,| ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n15t:h rnote: efield 'nthreads' will be initialized after field 'tidInBlock'a ds), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~~~~~~~B loc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize'I dx.x) ,562 | g r o u pt(igdr(otuipd)),, n| t ^~~~~~~~~~~~~~~~~h rea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:s562(:n60t:h rnote: efield 'group' will be initialized after field 'stepSize'a ds), t562i | d I n B ltoicdk((ttihdr)e,a dnItdhxr.exa)d,s (gnrtohurpe(agdrso)u,p )t,i d I| n ^~~~~~~~~~~B lock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h); \ : 562| : ^15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562563: | 60 : note: field 'group' will be initialized after field 'stepSize's tepSi z562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i ds(tteipdS)i,z en(tnhcrcelaSdhsm(enmt.hcroemamd.sb)u,f ftSiidzIensB[lNoCcCkL(_tPhRrOeTaOd_ISdIxM.PxL)E,] /gNrCoCuLp_(SgTrEoPuSp/)s,i z e| o ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f ( T| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~563 | | group(group stepSiz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:(626n:c9c:l Snote: hin instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem em.co m626m | . b u f f S i z epsr[iNmCsC(Lt_iPdR-OtTiOd_SStIaMrPtLSEc]a/tNtCeCrL,_ SnTTEhPrSe/asdiszSecoaft(tTe)r), {N U L| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, d| i group(groupr ect->up, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ha:r666g:s9-:> snote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren dbuff ,666 | a r g s - > r e cpvrbiumfsf(,t i d| , ^ nThrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202G:a53t:h enote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, dir e202c | t - > u p , N URLuLn,W oarrkgEsl-e>mseenntdpr,e cAvlbguof,f ,P r o| t ^o >().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:w202e:)53;: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 10 : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereR unWo r10k | EIlMePmLe_nCtO_(D)I.RrEuCnT(,w eS)I;M P L| E ^, Max, h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppa:l9f:)1 : | note: ^in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :9391 | :I95M:P Lnote: _expanded from macro 'IMPL_COLL_FUNC'C OLL_FU N391C | ( A lRluRneWdourcke<,n cCcOlLFLuNnEcT#_#DfIuRnEcC,T ,t ySpIeM,P LFEu,n cM#a#xd,e vuriendto6p4<_tty)p e >| ,^ NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:A391L:G95O:_ #note: #expanded from macro 'IMPL_COLL_FUNC'a lgo, N391C | C L _RPuRnOWToOr_k#<#npcrcoltFou>n(c)#.#rfuunn(c&,n ctcylpSeh,m eFmu.nwco#r#kd)e;v r\e d o| p ^< type>,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :N562C:C15L:_ Anote: Lfield 'nthreads' will be initialized after field 'tidInBlock'G O_##a l562g | o , N CtCiLd_(PtRiOdT)O,_ #n#tphrroetaod>s(()n.trhurne(a&dnsc)c,l SthimdeImn.Bwloorckk)(;t h\r e a| d ^I dx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pnote: (field 'nthreads' will be initialized after field 'tidInBlock'g roup) ,562 | | ^~~~~~~~~~~~~~~~~ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d60):, note: nfield 'group' will be initialized after field 'stepSize't hread s562( | n t h r etaidds()t,i dt)i,d InntBhlroecakd(st(hnrtehardeIaddxs.)x,) ,t igdrIonuBpl(ogcrko(utph)r,e a d| I ^~~~~~~~~~~~~~~~~d x.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r60o: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ orkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ -tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ unc##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p ), | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60t:i dnote: (field 'group' will be initialized after field 'stepSize't id), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(g r563o | u p ) , s t| e ^~~~~~~~~~~p Size(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (warning: tinitializer order does not match the declaration order [-Wreorder-ctor]h readIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~a ds(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Bloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~f fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]c k(threadI d562x | . x ) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s (nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60s:) ,note: field 'group' will be initialized after field 'stepSize't idInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I nBloc k563( | t h r e asdtIedpxS.ixz)e,( ngcrcoluSph(mgermo.ucpo)m,m . b| u ^~~~~~~~~~~f fSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:L562L:N15E:T _warning: Dinitializer order does not match the declaration order [-Wreorder-ctor]I RECT, SI M562P | L E , Mtaixd,( thiadl)f,) n t| h^r eads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:n391t:h95r:e anote: dexpanded from macro 'IMPL_COLL_FUNC's ), tid I391n | B l oRcukn(Wtohrrke ,s tNeCpCSLi_zAeL(GnOc_c#l#Sahlmgeom,. cNoCmCmL._bPuRfOfTSOi_z#e#sp[rNoCtCoL>_(P)R.OrTuOn_(S&InMcPcLlES]h/mNeCmC.Lw_oSrTkE)P;S /\s i z| e ^o f(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :{562 : 15| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock'| group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641t:i11d:( tnote: iin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered ), nth r641e | a d s ( n t h r e a dpsr)i,m st(itdiIdn-BtliodcSkt(atrhtrReeadduIcdex,. xn)T,h rgeraoduspR(egdruocuep,) ,d i r| e ^~~~~~~~~~~~~~~~~c t->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:o562w:n60,: ¬e: dfield 'group' will be initialized after field 'stepSize'i rect -562> | o u t , tairdg(st-i>ds)e,n dnbtuhfrfe,a dasr(gnst-h>rreeacdvsb)u,f ft,i d I| n ^B lock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. x), g202r | o u p ( g r o u pR)u,n W o| r ^~~~~~~~~~~k Element().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidSta(rgtrRoeudpu)c,e , | n ^~~~~~~~~~~T hreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hads(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidInBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidInB l563o | c k ( t hsrteeapdSIidzxe.(xn)c,c lgSrhomuepm(.gcroomump.)b,u f f| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i z e| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)[ NCCL_ P563R | O T O _ SsItMePpLSEi]z/eN(CnCcLc_lSSThEmPeSm/.sciozmemo.fb(uTf)f)S i{z e s| [ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~N C C| L group(group_ PROTO_SIMPLE]/NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:_666S:T9E:P Snote: /in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres izeof( T666) | ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ p| r group(groupi ms(tid, nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hG:a655t:h11e:r ,note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered irect -655> | u p , N U L L , aprrgism-s>(steindd-btuifdfS,t aarrtgRse-d>urceec,v bnuTfhfr,e a d| s ^R educe, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:l202l:p53t:r ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here& dire c202t | - > o u t , a rRgusn-W>osreknEdlbeumfefn,t ,r eRcevdbOupf,f ,A l g| o ^, Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:>202(:)53.:r unote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( we) ;202 | | ^ Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppn:W10o:r1k:E lnote: ein instantiation of member function 'RunWork, 2, 2>::run' requested herem entR(e)d.urcuen,( wCeO)L;L N E| T ^_ DIRECT, SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp,: 12M:a1x:, note: hin instantiation of member function 'RunWork, 2, 2>::run' requested herea lf) 12| | ^I MPL_C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:L391L:_95F:U Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'( AllRed u391c | e , RCuOnLWLoNrEkT<_nDcIcRlEFCuTn,c #S#IfMuPnLcE,, tMyapxe,, dFouunbcl#e#)d e v| r^e dop95,: Nnote: Cexpanded from macro 'IMPL_COLL_FUNC'C L_ALGO _391# | # a lRguon,W oNrCkC (t)y.preu,n (F&unnccc#l#Sdhemverme.dwoopr , | N ^C CL_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#15a:l gnote: ofield 'nthreads' will be initialized after field 'tidInBlock', NCCL _562P | R O T O _t#i#dp(rtoitdo)>,( )n.trhurne(a&dnsc(cnltShhrmeeamd.sw)o,r kt)i;d I\n B l| o ^c k(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562I:d15x:. xnote: )field 'nthreads' will be initialized after field 'tidInBlock', group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~i d), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize'( nthre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~~~~~~~t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.60x:) ,note: field 'group' will be initialized after field 'stepSize'g roup(gro u562p | ) , | t ^~~~~~~~~~~i d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : ^~~~~~~~~~~562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nul/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:p562t:r15,: awarning: rinitializer order does not match the declaration order [-Wreorder-ctor]g s->send b562u | f f , atrigds(-t>irde)c,v bnutfhfr,e a d| s ^( nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t202i:d53I:n Bnote: lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo ck(th r202e | a d I d x . x ) ,R ugnrWoourpk(Eglreomuepn)t,< F n| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ T ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R edOp ,563 | A l g o ,s tPerpoStioz>e(()n.crculnS(hwmee)m;. c o| m ^m .buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppz:e10s:[1N:C Cnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested here_ PROTO _10S | IIMMPPLLE_]C/ONLCLC_LF_USNTCE(PASl/lsRiezdeuocfe(,T )C)O L{L N E| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ D I| R group(groupE CT, SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hL:E641,: 11M:a xnote: ,in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here half) 641| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391p:r95i:m snote: (expanded from macro 'IMPL_COLL_FUNC't id-tidSt a391r | t R eRduuncWeo,r knFduonwcn#,# d&edvirreedcotp-<>toyupte,> ,a rNgCsC-L>_sAeLnGdOb_u#f#fa,l gaor,g sN-C>CrLe_cPvRbOuTfOf_,# # p| r ^o to>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:r202u:n53(:& nnote: cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herec lShm e202m | . w o r k ) ; \R u n| W ^o rkElem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:n562t:<15F:n ,note: field 'nthreads' will be initialized after field 'tidInBlock'T , RedO p562, | A l g ot,i dP(rtoitdo)>,( )n.trhurne(awdes)(;n t h| r ^e ads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :t10i:d1I:n Bnote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereo ck( t10h | rIeMaPdLI_dCxO.LxL)_,F UgNrCo(uApl(lgRreoduupc)e,, C| O ^~~~~~~~~~~~~~~~~L LNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:_562D:I60R:E Cnote: Tfield 'group' will be initialized after field 'stepSize', SIMPL E562, | M a x ,t ihda(ltfi)d ) ,| ^n thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391s:(95n:t hnote: rexpanded from macro 'IMPL_COLL_FUNC'e ads), t391i | d I nRBulnoWcokr(kt, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(nth:r562e:a15d:s )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] tidInBlock(threadIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~e ads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t60i:d Inote: nfield 'group' will be initialized after field 'stepSize'B lock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , tidI n563B | l o c k (sttherpeSaidzIed(xn.cxc)l,S hgmreomu.pc(ogmrmo.ubpu)f,f S i| z ^~~~~~~~~~~e s[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 562:15: 563warning: | initializer order does not match the declaration order [-Wreorder-ctor] stepSi z562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dSIInMBPlLoEc]k/(NtChCrLe_aSdTIEdPxS./xs)i,z egorfo(uTp)()g r{o u p| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, | | group(group ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 687563: | 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres tepS i687z | e ( n c c l S h m e mp.rciommsm(.tbiudf-ftSiidzSetsa[rNtCBCcLa_sPtR,O TnOT_hSrIeMaPdLsEB]c/aNsCtC,L _&SdTiErPeSc/ts-i>zoeuotf,( Tn)u)l l{p t r| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ a r| g group(groups ->sendbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677a:r11g:s -note: >in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer ecvbuff ,677 | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :p202r:i53m:s (note: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei d-t i202d | S t a r t B c a sRtu,n WnoTrhkrEelaedmseBnct().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(thre tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15I:n Bwarning: linitializer order does not match the declaration order [-Wreorder-ctor]o ck(thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads), 563t | i d I n BsltoecpkS(itzher(enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C L _| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)R OTO _563S | I M P L Es]t/eNpCSCiLz_eS(TnEcPcSl/Sshimzeemo.fc(oTm)m). b{u f f| S ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i z e| s group(group[ NCCL_PROTO_SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h]:/687N:C11C:L _note: Sin instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT EPS/si z687e | o f ( T ) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p r i| m group(groups (tid-tidStar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:B666c:a9s:t ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren Thre a666d | s B c a s t , &pdriirmesc(tt-i>do,u tn,T hnruelaldpstGra,t haerrg,s -d>isreencdtb-u>fufp,, aNrUgLsL-,> raercgvsb-u>fsfe,n d b| u ^f f, args->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202c:v53b:u fnote: fin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : Rnote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Wor k202E | l e m e n t < F nR,u nTW,o rRkeEdlOepm,e nAtld(O)p.,r uAnl(gwoe,) ;P r o| t ^o >().run(we/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp):;10 : 1| : ^ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp :1011 | :I1M:P Lnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereC OLL _11F | UINMCP(LA_lClORLeLd_uFcUeN,C (CAOlLlLRNeEdTu_cDeI,R ECCOTL,L NSEITM_PDLIER,E CMTa,x ,S IhMaPlLfE), M| a^x , fl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:a391t:)95 : | note: ^expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :39195 | : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWork <391n | c c lRFuunnWco#r#kfv,r eNdCoCpL<_tAyLpGeO>_,# #NaClCgLo_,A LNGCOC_L#_#PaRlOgToO,_ #N#CpCrLo_tPoR>O(T)O._r#u#np(r&ontcoc>l(S)h.mreumn.(w&onrckc)l;S h\m e m| . ^w ork); \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^15 : note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :562 | note: field 'nthreads' will be initialized after field 'tidInBlock' ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~~~~~~~o up),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~~~~~~~60 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :field 'group' will be initialized after field 'stepSize'562 :60: note: field 'group' will be initialized after field 'stepSize'562 | 562t | i d ( t itdi)d,( tnitdh)r,e andtsh(rnetahdrse(andtsh)r,e atdisd)I,n BtliodcIkn(Btlhorceka(dtIhdrxe.axd)I,d xg.rxo)u,p (ggrroouupp()g,r o u| p ^~~~~~~~~~~) , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALG/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#15a:l gwarning: oinitializer order does not match the declaration order [-Wreorder-ctor], NCCL_P R562O | T O _ # #tpirdo(ttoi>d()),. rnutnh(r&enacdcsl(Snhtmherme.awdosr)k,) ;t i\dI | n ^B lock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15I:d xnote: .field 'nthreads' will be initialized after field 'tidInBlock'x ), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h read s563( | n t h r esatdesp)S,i ztei(dnIcncBllSohcmke(mt.hcroemamd.Ibduxf.fxS)i,z egsr[oNuCpC(Lg_rPoRuOpT)O,_ S I| M ^~~~~~~~~~~~~~~~~P LE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L60_:S Tnote: Efield 'group' will be initialized after field 'stepSize'P S/siz e562o | f ( T ) )t i{d ( t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ) ,| group(groupn thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:(677n:t11h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s), t i677d | I n B l o c k ( t h rperaidmIsd(xt.ixd)-,t igdrSotuapr(tgBrcoauspt),, n T| h ^~~~~~~~~~~r eadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork,r eNaCdCsL(_nAtLhGrOe_a#d#sa)l,g ot,i dNICnCBLl_oPcRkO(TtOh_r#e#apdrIodtxo.>x()),. rgurno(u&pn(cgcrloSuhpm)e,m . w| o ^~~~~~~~~~~~~~~~~r k); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h60::562 :note: 15field 'group' will be initialized after field 'stepSize': note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]x .x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~) , nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s (note: nfield 'group' will be initialized after field 'stepSize't hread s562) | , t i dtIindB(ltoicdk)(,t hnrtehardeIaddxs.(xn)t,h rgeraodusp)(,g rtoiudpI)n,B l o| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~k ( t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eadI d563x | . x ) , sgtreopuSpi(zger(onucpc)l,S h m| e ^~~~~~~~~~~m .comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h note: field 'nthreads' will be initialized after field 'tidInBlock' :562:15 :562 | warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:(60g:r onote: ufield 'group' will be initialized after field 'stepSize'p ), | 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(t i563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzherse[aNdCICdLx_.PxR)O,T Og_rSoIuMpP(LgEr]o/uNpC)C,L _ S| T ^~~~~~~~~~~E PS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nt, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):.562r:u15n:( wwarning: einitializer order does not match the declaration order [-Wreorder-ctor]) ; | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp : 11 : 1t:i dnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested heret id), 11n | tIhMrPeLa_dCsO(LnLt_hFrUeNaCd(sA)l,l RteidduIcneB,l oCcOkL(LtNhErTe_aDdIIRdExC.Tx,) ,S IgMrPoLuEp,( gMraoxu,p )f,l o a| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h563: | 391 : 95 : snote: texpanded from macro 'IMPL_COLL_FUNC'e pSize(n c391c | l S hRmuenmW.ocrokms,i zNeCoCfL(_TA)L)G O{_ # #| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l g o| , group(group NCCL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hT:O687_:#11#:p rnote: oin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret o>().r u687n | ( & n c c l S h m e mp.rwiomrsk()t;i d\- t i| d ^S tartB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:a562s:t15,: nnote: Tfield 'nthreads' will be initialized after field 'tidInBlock'h reads B562c | a s t , t&iddi(rteicdt)-,> onutth,r enaudlsl(pnttrh,r eaardgss)-,> steinddIbnuBflfo,c ka(rtghsr-e>ardeIcdvxb.uxf)f,, g r| o ^u p(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:)202,: 53 :| ^~~~~~~~~~~~~~~~~note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60202: | note: field 'group' will be initialized after field 'stepSize' 562R | u n W o rtkiEdl(etmiedn)t,< Fnnt,h rTe,a dRse(dnOtph,r eAaldgso),, PtriodtIon>B(l)o.crku(nt(hwree)a;d I d| x ^. x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppr:o12u:p1(:g rnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested hereu p), 12| | ^~~~~~~~~~~I MPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), groSTEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hSIMPL:E562]:/15N:C Cwarning: Linitializer order does not match the declaration order [-Wreorder-ctor]_ STEPS/sizeof(T)) {562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| i group(groupd (tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h687r:e11a:d snote: (in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren thr e687a | d s ) , t i d I n Bplroicmks((tthirde-atdiIddSxt.axr)t,B cgarsotu,p (ngTrhoruepa)d,s B c| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s t ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)& dire c563t | - > o u ts,t enpuSlilzpet(rn,c calrSghsm-e>ms.ecnodmbmu.fbfu,f faSrigzse-s>[rNeCcCvLb_uPfRfO,T O _| S ^I MPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C202C:L53_:S Tnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereP S/s i202z | e o f ( T ) ) {R u n| W ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o r k| E group(groupl ement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo , Proto >641( | ) . r u n ( w e ) ; p r| i ^m s(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp-:t12i:d1S:t anote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heret Red u12c | eI,M PnLT_hCrOeLaLd_sFRUeNdCu(cAel,l Rdeidrueccet,- >CdOoLwLnN,E T&_dDiIrReEcCtT-,> oSuItM,P LaEr,g sM-a>xs,e ndouble) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ dbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O15_:# #warning: ainitializer order does not match the declaration order [-Wreorder-ctor]l go, NCCL_ P562R | O T O _ #t#ipdr(ottiod>)(,) .nrtuhnr(e&andcsc(lnSthhmreema.dwso)r,k )t;i d\I n B| l ^o ck(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) nthr e563a | d s ( n tshtreepaSdisz)e,( ntcicdlISnhBmleomc.kc(otmhmr.ebaudfIfdSxi.zxe)s,[ NgCrCoLu_pP(RgOrToOu_pS)I,M P L| E ^~~~~~~~~~~~~~~~~] /NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T60E:P Snote: /field 'group' will be initialized after field 'stepSize's izeo f562( | T ) ) {t i d| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ) group(group, nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hn:t626h:r9e:a dnote: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , tidI n626B | l o c k ( t h r epardiImdsx(.txi)d,- tgirdoup(gSrtoaurpt)S,c a t| t ^~~~~~~~~~~e r, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ uff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h tid(:t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads(nthread s562) | , t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~c k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e15(:n cwarning: cinitializer order does not match the declaration order [-Wreorder-ctor]l Shmem.comm. b562u | f f S i zteisd[(NtCiCdL)_,P RnOtThOr_eSaIdMsP(LnEt]h/rNeCaCdLs_)S,T EtPiSd/IsniBzleoocfk((Tt)h)r e{a d I| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~x . x| ) group(group, group(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hg:r677o:u11p:) ,note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)677 | 563 | psrtiempsS(itzied(-ntcicdlSSthamretmB.ccaosmtm,. bnuTfhfrSeiazdessB[cNaCsCtL,_ P&RdOiTrOe_cStI-M>PoLuEt],/ NdCiCrLe_cSt->dToEwPnS,/ sairzgeso-f>(sTe)n)d b{u f f| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ a r| g group(groups ->recvbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641| : ^11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :64153 | : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s ( t i dR-utniWdoSrtkaErlteRmeednutc>(d)o.wrnu,n (&wdei)r;e c t| - ^> out, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppg:s11-:>1s:e nnote: din instantiation of member function 'RunWork, 2, 2>::run' requested hereb uff ,11 | aIrMgPsL-_>CrOeLcLv_bFuUfNfC,( A l| l ^R educe,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :C202O:L53L:N Enote: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here_ DIRE C202T | , S I M P L E ,R uMnaWxo,r kfElloeamte)n t <| F^n , T,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391e:d95O:p ,note: expanded from macro 'IMPL_COLL_FUNC'A lgo, P391r | o t oR>u(n)W.orrukn<(nwcec)l;F u n| c ^# #func, t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppy:p12e:,1 :F unote: nin instantiation of member function 'RunWork, 2, 2>::run' requested herec ##de v12r | eIdMoPpL<_tCyOpLeL>_,F UNNCCC(LA_lAlLRGeOd_u#c#ea,l gCoO,L LNNCECTL__DPIRROETCOT_,# #SpIrMoPtLoE>,( )M.arxu,n (d&onucbclleS)h m e| m^. work/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):;391 :\95 : | note: ^expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562391: | 15 : Rnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n Work< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~~~~~~~t o>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:r562u:n60(:& nnote: cfield 'group' will be initialized after field 'stepSize'c lShme m562. | w o r k )t;i d\( t i| d ^) , nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15s:( nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads )562, | t i d ItniBdl(otcikd()t,h rnetahdrIedaxd.sx()n,t hgrreoaudps()g,r otuipd)I,n B l| o ^~~~~~~~~~~c k(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:s562i:z15e:o fwarning: (initializer order does not match the declaration order [-Wreorder-ctor]T )) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group562 | tid(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :n687t:h11r:e anote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (nthr e687a | d s ) , t i d I n Bplroicmks((tthirde-atdiIddSxt.axr)t,B cgarsotu,p (ngTrhoruepa)d,s B c| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s t ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)& dire c563t | - > o u ts,t enpuSlilzpet(rn,c calrSghsm-e>ms.ecnodmbmu.fbfu,f faSrigzse-s>[rNeCcCvLb_uPfRfO,T O _| S ^I MPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:S202T:E53P:S /note: sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herei zeof (202T | ) ) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R u| n group(groupW orkElement, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree dOp, A687l | g o , P r o t o > (p)r.irmusn((twied)-;t i d| S ^t artBca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpps:t12,: 1n:T hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads B12c | aIsMtP,L _&CdOiLrLe_cFtU-N>Co(uAtl,l Rneudlulcpter,, CaOrLgLsN-E>Ts_eDnIdRbEuCfTf,, SaIrMgPsL-E>,r eMcavxb,u fdfo,u b l| e ^) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::53391:: 95note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here note: expanded from macro 'IMPL_COLL_FUNC' 202 | 391 | RRuunnWWoorrkkd(o)p.e,) ;N C C| L ^_ ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cppa:l11g:o1,: Nnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereC L_PR O11T | OI_M#P#Lp_rCoOtLoL>_(F)U.NrCu(nA(l&lnRcecdluSchem,e mC.OwLoLrNkE)T;_ D\I R E| C ^T , SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:E562,: 15M:a xnote: ,field 'nthreads' will be initialized after field 'tidInBlock' float) 562 | | ^ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:(391t:i95d:) ,note: expanded from macro 'IMPL_COLL_FUNC'n thread s391( | n t hRruenaWdosr)k,< ntcicdlIFnuBnlco#c#kf(utnhcr,e atdyIpdex,. xF)u,n cg#r#oduepv(rgerdooupp<)t,y p e| > ^~~~~~~~~~~~~~~~~, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562A:L60G:O _note: #field 'group' will be initialized after field 'stepSize'# algo, 562N | C C L _ PtRiOdT(Ot_i#d#)p,r onttoh>r(e)a.drsu(nn(t&hnrcecaldSsh)m,e mt.iwdoIrnkB)l;o c\k ( t| h ^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)15,: gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up(gr o562u | p ) , t| i ^~~~~~~~~~~d (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ E, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(Al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nccSlShmTeEmP.Sw/osrikz)e;o f\( T )| ) ^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h group(group: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641 :t11i:d (note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei d), nt h641r | e a d s ( n t h r e apdrsi)m,s (ttiiddI-ntBildoSctka(rtthRreedaudcIed,x .nxT)h,r egardosuRpe(dgurcoeu,p )d,i r e| c ^~~~~~~~~~~~~~~~~t ->do/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:n562,: 60&:d inote: rfield 'group' will be initialized after field 'stepSize'e ct->o u562t | , a r gtsi-d>(steindd)b,u fnft,h raeragdss-(>nrtehcrvebaudfsf),, t| i ^d InBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. x), g r202o | u p ( g r o u p )R,u n W| o ^~~~~~~~~~~r kElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx90a. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidSt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ artScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~h readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 76%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorki,d )N,C CnLt_hArLeGaOd_s#(#natlhgroe,a dNsC)C,L _tPiRdOITnOB_l#o#cpkr(otthor>e(a)d.Irduxn.(x&)n,c cglrSohumpe(mg.rwoourpk)),; \| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :563562 | : 15 : note: sfield 'nthreads' will be initialized after field 'tidInBlock't epSiz e562( | n c c l Sthimde(mt.icdo)m,m .nbtuhfrfeSaidzse(sn[tNhCrCeLa_dPsR)O,T Ot_iSdIIMnPBLlEo]c/kN(CtChLr_eSaTdEIPdSx/.sxi)z,e ogfr(oTu)p)( g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | group(group| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h60::916 :note: 7field 'group' will be initialized after field 'stepSize': note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 916 | t i d ( t i dp)r,i mnst(hgrreoaudpsTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ MPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15(:g rwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | t563i | d ( t i ds)t,e pnStihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i z e| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f (T)) {563 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ s| t group(groupe pSize(ncclShmem.comm.bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:f916S:i7z:e snote: [in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested hereN CCL_P R916O | T O _ S I M PpLrEi]m/sN(CgCrLo_uSpTTEiPdS,/ sgirzoeuopfN(tTh)r)e a{d s ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~& r e| c group(groupv , &send, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:n916d:b7u:f fnote: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here args- >916r | e c v b u f fp,r i m| s ^( groupTid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 202g:r53o:u pnote: Nin instantiation of member function 'RunWorkElement, 3, 2>::run' requested heret hrea d202s | , & r e c v , R&usneWnodr,k Ealregmse-n>tsArlegcov,b uPfrfo,t o >| ( ^) .run(we);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^53 : note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7 :2021 | : note: in instantiation of member function 'RunWork, 3, 2>::run' requested here R7u | nIWMoPrLk_EClOeLmLe_nFtUN(,) .SrIuMnP(LwEe,) ;M i n| , ^ uint3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp2:_8t:)1 : | note: ^in instantiation of member function 'RunWork, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :8391 | :I95M:P Lnote: _expanded from macro 'IMPL_COLL_FUNC'C OLL_FUN C391( | A l lRRuendWuocrek,< nCcOcLlLFNuEnTc_#C#HfAuInNc,, StIyMpPeL,E ,F uMnicn#,# dienvtr6e4d_otp)< t y| p^e >, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C391L:_95A:L Gnote: Oexpanded from macro 'IMPL_COLL_FUNC'_ ##algo, 391N | C C LR_uPnRWOoTrOk_<#n#cpcrloFtuon>c(#)#.fruunnc(,& ntcycpleS,h mFeumn.cw#o#rdke)v;r e\d o p| < ^t ype>, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:A Lnote: Gfield 'nthreads' will be initialized after field 'tidInBlock'O _##al g562o | , N C CtLi_dP(RtOiTdO)_,# #nptrhorteoa>d(s)(.nrtuhnr(e&andcsc)l,S htmiedmI.nwBolrokc)k;( t\h r e| a ^d Idx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r15o:u pnote: (field 'nthreads' will be initialized after field 'tidInBlock'g roup) ,562 | | ^~~~~~~~~~~~~~~~~ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t60i:d )note: ,field 'group' will be initialized after field 'stepSize' nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~. x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562g:r60o:u pnote: (field 'group' will be initialized after field 'stepSize'g roup) ,562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :note: 562field 'nthreads' will be initialized after field 'tidInBlock': 15: warning: initializer order does not match the declaration order [-Wreorder-ctor]562 | tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 : 60| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize'| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d (sttiedp)S,i znet(hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~o f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hlock(:t562h:r15e:a dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]d x.x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]( tid), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~x ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)g roup( g563r | o u p ) ,s t e| p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S i z| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nccl S563h | m e m . csotmemp.Sbiuzfef(SniczcelsS[hNmCeCmL._cPoRmOmT.Ob_uSfIfMSPiLzEe]s/[NNCCCCLL__SPTREOPTSO/_sSiIzMePoLfE(]T/)N)C C{L _ S| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~E P S| / group(groups izeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~916 : 7| : group(group note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h916: | 916 : 7 : note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested herep rims (916g | r o u p T i dp,r igmrso(ugprNotuhprTeiadd,s ,g r&oupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hi:d514):,9 :n twarning: hvariable 'offset' set but not used [-Wunused-but-set-variable]r ead s514( | n t h r eiandts )o,f ftsiedtI n=B ltoicdk;( t h| r ^e adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] d 562 | e x ,t iadr(gtsi-d>)c,o nnntIhnrdeeaxd)s;( n t| h ^r eads), tidInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hc:k80(:t5h:r enote: ain instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hered Idx. x80) | , g r oruupn(Rgirnogu563( | a r g s )s;t e p| S ^i ze(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:h202m:e53m:. cnote: oin instantiation of member function 'RunWorkElement, 1, 2>::run' requested herem m.bu f202f | S i z e s [ N C CRLu_nPWRoOrTkOE_lSeImMePnLtE<]F/nN,C CTL,_ SRTeEdPOSp/,s iAzlegoof,( TP)r)o t{o > (| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. r u| n group(group( we); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :34:7:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp :note: 7in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 34 | 7 | I M PpLr_iCmOsL(Lt_iFdU,N Cn(tRherdeuacdes,, R&IrNiGn,g -S>IpMrPeLvE,, &Mraixn,g -u>innetx3t2,_ ta)r g s| -^> sendb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f391f:,95 :a rnote: gexpanded from macro 'IMPL_COLL_FUNC's ->recv b391u | f f ,R uanrWgosr-k>pceo,n nFIunndce#x#,d eavrrgesd-o>pcd,e xN)C;C L _| A ^L GO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h#:a80l:g5o:, note: Nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereC CL_ P80R | O T O _ #r#upnrRoitnog><(T),. rRuend(O&pn,c cPlrSohtmoe>m(.awrogrsk));; \| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here15 : note: field 'nthreads' will be initialized after field 'tidInBlock'202 | 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Shmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]60 : note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | step/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Size(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h; :| 562 ^: 15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cppinitializer order does not match the declaration order [-Wreorder-ctor]: 13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 562 | 13 | I M PtLi_dC(OtLiLd_)F,U NnCt(hRreedaudcse(,n tRhIrNeGa,d sS)I,M PtLiEd,I nMBalxo,c kr(ctchlr_ebafdlIodaxt.1x6)), g| r^o up(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:)95,: note: | expanded from macro 'IMPL_COLL_FUNC' ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 391 | R563u | n W o r ksO,_ SNICMCPLL_EA]L/GNOC_C#L#_aSlTgEoP,S /NsCiCzLe_oPfR(OTT)O)_ #{# p r| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t o >| ( group(group) .run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hc:l34S:h7m:e mnote: .in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herew ork); \ 34 | | ^ pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:m562s:(15t:i dnote: ,field 'nthreads' will be initialized after field 'tidInBlock' nthre a562d | s , & rtiindg(-t>ipdr)e,v ,n t&hrrienagd-s>(nnetxhtr,e aadrsg)s,- >tsiednIdnbBulfofc,k (atrhgrse-a>drIedcxv.bxu)f,f ,g raorugps(-g>rroeudpO)p,A r g| , ^~~~~~~~~~~~~~~~~ 0, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562g:s60-:> cnote: ofield 'group' will be initialized after field 'stepSize'n nIn d562e | x , a rtgisd-(>tciodn)n,I nndtehxr)e;a d s| ( ^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:s80):,5 :t inote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereI nBl o80c | k ( t h rreuandRIidnxg.,( a r| g ^~~~~~~~~~~s ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:d562x:.15x:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]g roup(group) ,562 | | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d (tid), n t563h | r e a d ss(tnetphSriezaed(sn)c,c ltSihdmIenmB.lcoocmkm(.tbhurfefaSdiIzdexs.[xN)C,C Lg_rPoRuOpT(Og_rSoIuMpP)L,E ] /| N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C C L| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S TEPS /563s | i z e o fs(tTe)p)S i{z e (| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c c l| S group(grouph mem.comm.buffSizes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hN:C34C:L7_:P Rnote: Oin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT O_SIMP L34E | ] / N C C L _pSrTiEmPsS(/tsiidz,e onft(hTr)e)a d{s , | & ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r i n| g group(group- >prev, &ring->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hn:e34x:t7,: anote: rin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereg s->sen d34b | u f f , a rpgrsi-m>sr(etcivdb,u fnft,h raeragdss-,> r&erdiOnpgA-r>gp,r e0v,, a&rrgisn-g>-c>onnenxItn,d eaxr,g sa-r>gsse-n>dcbounfnfI,n daerxg)s;- > r| e ^c vbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h,: 80a:r5g:s -note: >in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested herer edO p80A | r g , 0r,u naRrignsg-<>Tc,o nRneIdnOdpe,x ,P raortgos>-(>acrognsn)I;n d e| x ^) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h::5380:: 5note: :in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 202 | 80 | r uRnuRniWnogrR(eadrOgps,) ;A l g| o ^, Proto>()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:r202u:n53(:w enote: )in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here; | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp : 7 : 1R:u nnote: Win instantiation of member function 'RunWork, 1, 2>::run' requested hereo rkE l7e | mIeMnPtL<_FCnO,L LT_,F URNeCd(ORpe,d uAcleg,o ,R IPNrGo,t oS>I(M)P.LrEu,n (Swuem)P;o s t| D ^i v, uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp3:26_:t1): note: | in instantiation of member function 'RunWork, 1, 2>::run' requested here^ 6 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:M391P:L95_:C Onote: Lexpanded from macro 'IMPL_COLL_FUNC'L _FUNC(R e391d | u c eR,u nRWIoNrGk, Snote: ,expanded from macro 'IMPL_COLL_FUNC' NCCL_A L391G | O _ #R#uanlWgoor,k p(e),. rFuunn(c&#n#cdcelvSrhemdeomp.;, \N C C| L ^_ ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:a562l:g15o:, note: Nfield 'nthreads' will be initialized after field 'tidInBlock'C CL_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d Inote: dfield 'nthreads' will be initialized after field 'tidInBlock'x .x), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:s60(:n tnote: hfield 'group' will be initialized after field 'stepSize'r eads) ,562 | t i d I ntBildo(ctki(dt)h,r enatdhIrdexa.dxs)(,n tghrroeuapd(sg)r,o utpi)d,I n B| l ^~~~~~~~~~~~~~~~~o ck(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:I dnote: xfield 'group' will be initialized after field 'stepSize'. x), gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~) , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.coIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx1101. 13 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | In file included from ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp :1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h391::1095: :In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hnote: :expanded from macro 'IMPL_COLL_FUNC'167 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :39115 | : warning: Rinitializer order does not match the declaration order [-Wreorder-ctor]u nWorki,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) . r| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n (&nc c563l | S h m e ms.tweoprSki)z;e (\n c c| l ^S hmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:o562m:m15.:b unote: ffield 'nthreads' will be initialized after field 'tidInBlock'f Sizes [562N | C C L _ PtRiOdT(Ot_iSdI)M,P LnEt]h/rNeCaCdLs_(SnTtEhPrSe/asdisz)e,o ft(iTd)I)n B{l o c| k ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t h| r group(groupe adIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hp:(34g:r7o:u pnote: )in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, | ^~~~~~~~~~~~~~~~~ 34/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 60 : note: pfield 'group' will be initialized after field 'stepSize'r ims(t i562d | , n t htrieda(dtsi,d )&,r inntgh-r>epardesv(,n t&hrrienagd-s>)n,e xtti,d IanrBglso-c>ks(etnhdrbeuafdfI,d xa.rxg)s,- >grreocuvpb(ugfrfo,u pa)r,g s -| > ^~~~~~~~~~~r edOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 71%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626 :| 9 ^: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202: 53626: | note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | p r i m s ( t i dR-utniWdoSrtkaErlteSmceanttt (d)i.rreucnt(-w>eu)p;, a| r ^g s->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppe:n4d:b1u:f fnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here arg s4- | >IrMePcLv_bCuOfLfL,_ F U| N ^C (AllReduce, COLL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:E202T:_53D:I Rnote: Ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC T, SIM P202L | E , M i n , iRnutn8W_otr)k E l| e^m ent/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h<:F391n:,95 :T ,note: expanded from macro 'IMPL_COLL_FUNC'R edOp, 391A | l g oR,u nPWroortko<>n(c)c.lrFuunn(cw#e#)f;u n c| , ^ type, Func#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp#:d4e:v1r:e dnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested herep 4, | INMCPCLL__CAOLLGLO__F#U#NaCl(gAol,l RNeCdCuLc_eP,R OCTOOL_L#N#EpTr_oDtIoR>E(C)T.,r uSnI(M&PnLcEc,l SMhimne,m .iwnotr8k_)t;) \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::391562::9515:: note: note: expanded from macro 'IMPL_COLL_FUNC'field 'nthreads' will be initialized after field 'tidInBlock' 562 | 391 | RtuindW(otrikd<)n,c cnltFhurneca#d#sf(unntch,r etaydpse),, FtuindcI#n#Bdleovcrke(tdhorpe.,x )N,C CgLr_oAuLpG(Og_r#o#uapl)g,o , | N ^~~~~~~~~~~~~~~~~C CL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:P562R:O60T:O _note: #field 'group' will be initialized after field 'stepSize'# prot o562> | ( ) . r utni(d&(ntcicdl)S,h mnetmh.rweoardks)(;n t\h r e| a ^d s), tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:o562c:k15(:t hnote: rfield 'nthreads' will be initialized after field 'tidInBlock'e adIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~r eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R15O:T Owarning: initializer order does not match the declaration order [-Wreorder-ctor] _##proto>( )562. | r u n ( &tnicdc(ltSihdm)e,m .nwtohrrke)a;d s\( n t| h ^r eads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n15B:l onote: cfield 'nthreads' will be initialized after field 'tidInBlock'k (threa d562I | d x . x )t,i dg(rtoiudp)(,g rnotuhpr)e,a d s| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n t h| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ads), 563t | i d I n BsltoecpkS(itzher(enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~~~~~~~C L_PR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:T562O:_60S:I Mnote: Pfield 'group' will be initialized after field 'stepSize'L E]/NC C562L | _ S T E PtSi/ds(itziedo)f,( Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( n t| h group(groupr eads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:l641o:c11k:( tnote: hin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadIdx .641x | ) , g r o u p ( g rporuipm)s,( t i| d ^~~~~~~~~~~- tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ CCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ o>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)>recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args-id(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):, 562g:r15o:u pwarning: (initializer order does not match the declaration order [-Wreorder-ctor]g roup), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork:, warning: Ninitializer order does not match the declaration order [-Wreorder-ctor]C CL_ALGO _562# | # a l g ot,i dN(CtCiLd_)P,R OnTtOh_r#e#apdrso(tnot>h(r)e.ardusn)(,& ntcicdlISnhBmleomc.kw(otrhkr)e;a d\I | d ^x .x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15(:g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~562 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid (563t | i d ) , snttehprSeiazdes((nnctchlrSehamdesm).,c otmimd.IbnuBflfoScikz(etsh[rNeCaCdLI_dPxR.OxT)O,_ SgIrMoPuLpE(]g/rNoCuCpL)_,S T E| P ^~~~~~~~~~~~~~~~~S /si/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hz:e562o:f60(:T )note: )field 'group' will be initialized after field 'stepSize' { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562 | | group(group tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d687):,11 :n tnote: hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eads (687n | t h r e a d s ) , tpirdiImnsB(ltoicdk-(ttihdrSetaadrItdBxc.axs)t,, gnrTohurpe(agdrsoBucpa)s,t , | & ^~~~~~~~~~~d irect->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~x ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ irect->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Mi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:, 562i:n15t:8 _warning: tinitializer order does not match the declaration order [-Wreorder-ctor]) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :56295 | : note: expanded from macro 'IMPL_COLL_FUNC' tid(ti d391) | , nRtuhnrWeoardks<(nnctchlrFeuandcs#)#,f utnicd,I ntBylpoec,k (Ftuhnrce#a#ddIedvxr.exd)o,p (,g rNoCuCpL)_,A L G| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ # #| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l go, NC C563L | _ P R O TsOt_e#p#Spirzoet(on>c(c)l.Srhumne(m&.nccocmlmS.hbmuefmf.Swiozreks)[;N C\C L _| P ^R OTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:I562M:P15L:E ]note: /field 'nthreads' will be initialized after field 'tidInBlock'N CCL_ S562T | E P S / stiizde(otfi(dT)),) n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ( group(groupn threads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hB:l687o:c11k:( tnote: hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer eadIdx. x687) | , g r o u p ( g r opurpi)m,s ( t| i ^~~~~~~~~~~~~~~~~d -ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:S562t:a60r:t Bnote: cfield 'group' will be initialized after field 'stepSize'a st, 562n | T h r e atdisdB(ctaisdt),, &ndtihrreecatd-s>(onutth,r enaudlsl)p,t rt,i daIrngBsl-o>cske(ntdhbruefafd,I daxr.gxs)-,> rgercovubpu(fgfr,o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement562(:)15.:r uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]( we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp562: | 7 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid )7, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pM(ignr,o uupi)n,t 3 2| _ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t ) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h563: | 391 : 95 : snote: texpanded from macro 'IMPL_COLL_FUNC'e pSize (391n | c c lRSuhnmWeomr.kcS,/ sNiCzCeLo_fA(LTG)O)_ #{# a l| g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o , | N group(groupC CL_PROTO_##proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:)687.:r11u:n (note: &in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren cclShm e687m | . w o r k ) ; \ p| r ^i ms(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:-15t:i dnote: Sfield 'nthreads' will be initialized after field 'tidInBlock't artB c562a | s t , ntTihdr(etaidds)B,c anstth,r e&addisr(enctth-r>eoaudts,) ,n utlildpItnrB,l oacrkg(st-h>rseeanddIbduxf.fx,) ,a rggrso-u>pr(egcrvobuupf)f,, | | ^~~~~~~~~~~~~~~~~ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202field 'group' will be initialized after field 'stepSize': 53: note: 562in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | t i202d | ( t i d ) , n tRhurneWaodrsk(Enltehmreenatd)(,) .grruonu(pw(eg)r;o u p| ) ^, | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 562 :| 15 group(group: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h: 641562: | 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret id(ti d641) | , n t h r e a d s (pnrtihmrse(atdisd)-,t itdiSdtIanrBtlRoecdku(cteh,r enaTdhIrdexa.dxs)R,e dgurcoeu,p (dgirroeucpt)-,> d o| w ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n , | & tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d irect -563> | o u t , satregpsS-i>zsee(nndcbculfSfh,m eamr.gcso-m>mr.ebcuvfbfuSfifz,e s [| N ^C CL_PRO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hT:O202_:S53I:M Pnote: Lin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereE ]/NC C202L | _ S T E P S / s iRzuenoWfo(rTk)E)l e{m e n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~< F n| , group(group T, RedOp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 666A:l9g:o ,note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereP roto> (666) | . r u n ( w e ) ;p r i| m ^s (tid, n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppT:h5r:e1a:d snote: Gin instantiation of member function 'RunWork, 2, 2>::run' requested herea ther ,5 | dIiMrPeLc_tC-O>LuLp_,F UNNUCL(LA,l laRregdsu-c>es,e nCdObLuLfNfE,T _aDrIgRsE-C>Tr,e cSvIbMuPfLfE,, M| i ^n , uint8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:t202): 53 :| ^note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391202: | 95 : note: expanded from macro 'IMPL_COLL_FUNC' Ru n391W | o r kREulneWmoernkt<#(#)d.ervurne(dwoep)<;t y p| e ^> , NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppA:L7G:O1_:# #note: ain instantiation of member function 'RunWork, 2, 2>::run' requested herel go, 7N | CICMLP_LP_RCOOTLOL__#F#UpNrCo(tAol>l(R)e.druucne(,& nCcOcLlLSNhEmTe_mD.IwRoErCkT),; S\I M P| L ^E , Mi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:,562 :u15i:n tnote: 3field 'nthreads' will be initialized after field 'tidInBlock'2 _t) | 562^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:(95t:i dnote: )expanded from macro 'IMPL_COLL_FUNC', nthr e391a | d s (RnutnhWroerakd , | N ^~~~~~~~~~~~~~~~~C CL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O60_:# #note: afield 'group' will be initialized after field 'stepSize'l go, N C562C | L _ P R OtTiOd_(#t#ipdr)o,t on>t(h)r.eraudns((&nntchcrleSahdmse)m,. wtoirdkI)n;B l\o c k| ( ^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562d:x15.:x )note: ,field 'nthreads' will be initialized after field 'tidInBlock' group (562g | r o u p )t,i d (| t ^~~~~~~~~~~i d), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 15562: | note: field 'nthreads' will be initialized after field 'tidInBlock' tid( t562i | d ) , nttihdr(etaidds)(,n tnhtrheraedasd)s,( nttihdrIenaBdlso)c,k (ttihdrIenaBdlIodcxk.(xt)h,r egardoIudpx(.gxr)o,u pg)r,o u p| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g r o| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p ), | ^~~~~~~~~~~~~~~~~563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :s562t:e60p:S inote: zfield 'group' will be initialized after field 'stepSize'e (nccl S562h | m e m . ctoimdm(.tbiudf)f,S inztehsr[eNaCdCsL(_nPtRhOrTeOa_dSsI)M,P LtEi]d/INnCBClLo_cSkT(EtPhSr/esaidzIedoxf.(xT)),) g{r o u| p ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( g r| o group(groupu p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:g562o:,15 :N Cwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]L _PROTO_##prot o562> | ( ) . r utni(d&(ntcicdl)S,h mnetmh.rweoardks)(;n t\h r e| a ^d s)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i15d:I nnote: Bfield 'nthreads' will be initialized after field 'tidInBlock'l ock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), tidIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~~~~~~~z es/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h[:N562C:C60L:_ Pnote: Rfield 'group' will be initialized after field 'stepSize'O TO_S I562M | P L E ] /tNiCdC(Lt_iSdT)E,P Sn/tshirzeeaodfs((Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | t group(groupi dInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:e677a:d11I:d xnote: .in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex ), gro u677p | ( g r o u p ) , | p ^~~~~~~~~~~r ims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::202562::5315:: note: warning: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereinitializer order does not match the declaration order [-Wreorder-ctor] 202 | 562 | RtuindW(otrikdE)l,e mnetnhtr((t)h.rreuand(Iwdex).;x ) ,| ^g roup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppo:u5p:)1,: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 5 | IMP L563_ | C O L L _sFtUeNpCS(iAzlel(RnecdculcSeh,m eCmO.LcLoNmEmT._bDuIfRfESCiTz,e sS[INMCPCLLE_,P RMOiTnO,_ SuIiMnPtL8E_]t/)N C C| L^_ STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/:s391i:z95e:o fnote: (expanded from macro 'IMPL_COLL_FUNC'T )) { | 391 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | R group(groupu nWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heref unc, t y641p | e , F u n c # # d epvrriemdso(ptd,S tNaCrCtLR_eAdLuGcOe_,# #naTlhgroe,a dNsCRCeLd_uPcReO,T Od_i#r#epcrto-t>od>o(w)n.,r u&nd(i&rneccctl-S>homuetm,. waorrgks)-;> s\e n d| b ^u ff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-15>:r enote: cfield 'nthreads' will be initialized after field 'tidInBlock'v buff, 562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i202d:)53,: nnote: tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereh read s202( | n t h r e a d s )R,u ntWiodrIknEBlleomcekn(tt ( )| . ^~~~~~~~~~~~~~~~~r un(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:e562):;60 : | note: ^field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp562: | 6 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid )6, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pM(ignr,o uipn)t,3 2 _| t ^~~~~~~~~~~) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]) , nthr e562a | d s ( n tthirde(atdisd)),, tnitdhIrneBaldosc(kn(tthhrreeaaddsI)d,x .txi)d,I ngBrlooucpk((gtrhoruepa)d,I d x| . ^~~~~~~~~~~~~~~~~x ), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60(:g rnote: ofield 'group' will be initialized after field 'stepSize'u p), | 562 ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tid(t i563d | ) , n tshtreepaSdisz(en(tnhcrcelaSdhsm)e,m .tciodmImn.BbluofcfkS(itzherse[aNdCICdLx_.PxR)O,T Og_rSoIuMpP(LgEr]o/uNpC)C,L _ S| T ^~~~~~~~~~~E PS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15p:r iwarning: minitializer order does not match the declaration order [-Wreorder-ctor]s (tid-tidS t562a | r t B c atsitd,( tniTdh)r,e andtshBrceaasdts,( n&tdhirreeacdts-)>,o utti,d IdniBrleocctk-(>tdhorwena,d Iadrxg.sx-)>,s egnrdobuupf(fg,r oaurpg)s,- > r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c v b| u tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f f, | ^563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:e202p:S53i:z enote: (in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren ccl S202h | m e m . c o m m .RbuunfWfoSrikzEelse[mNeCnCtL<_FPnR,O TTO,_ SRIeMdPOLpE,] /ANlCgCoL,_ SPTrEoPtSo/>s(i)z.eroufn((Tw)e)) ;{ | | ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655 :511 | :I Mnote: Pin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL _COLL_F U655N | C ( A l l R e d u c ep,r iCmOsL(LtNiEdT-_tDiIdRSEtCaTr,t RSeIdMuPcLeE,, nMTihnr,e audisnRte8d_utc)e , | n^u llpt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:,391 :&95d:i rnote: eexpanded from macro 'IMPL_COLL_FUNC'c t->ou t391, | a rRgusn-W>osreknnrce,c vtbyupfef,, F u| n ^c ##devredop/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h<:t202y:p53e:> ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereN CCL_A L202G | O _ # # a l g o ,R uNnCWCoLr_kPERlOeTmOe_n#t#,( )R.erduOnp(,& nAclcgloS,h mPermo.twoo>r(k)).;r u\n ( | w ^e ); | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6 :5621 | : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here tid(t i6d | )I,M PnLt_hCrOeLaLd_sF(UnNtCh(rAelaldRse)d,u ctei,d ICnOBLlLoNcEkT(_tDhIrReEaCdTI,d xS.IxM)P,L Eg,r oMuipn(,g rionutp3)2,_ t )| ^~~~~~~~~~~~~~~~~ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 391field 'group' will be initialized after field 'stepSize': 95: note: 562expanded from macro 'IMPL_COLL_FUNC' | tid (391t | i d )R,u nnWtohrrkeo,u pN(CgCrLo_uApL)G,O _ #| # ^~~~~~~~~~~a lgo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h3:2562_:t15): warning: | initializer order does not match the declaration order [-Wreorder-ctor]^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(tid), 391n | t h rReuandWso(rnkto,u pN)C,C L _| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L G O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #alg o563, | N C C Ls_tPeRpOSTiOz_e#(#npcrcoltSoh>m(e)m..rcuonm(m&.nbcucflfSShimzeems.[wNoCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##p, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:m562.:w15o:r kwarning: )initializer order does not match the declaration order [-Wreorder-ctor]; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxpalgo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: :note: 562in instantiation of member function 'RunWork, 2, 2>::run' requested here: 15: warning: 7initializer order does not match the declaration order [-Wreorder-ctor] | IMPL_COLL _562F | U N C ( AtlildR(etdiudc)e,, nCtOhLrLeNaEdTs_(DnItRhErCeTa,d sS)I,M PtLiEd,I nMBilno,c ku(itnhtr3e2a_dtI)d x .| x^) , gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u391p:(95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 391 | | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) RunWo r563k | < n c c lsFtuenpcS#i#zfeu(nncc,c ltSyhpmee,m .Fcuonmcm#.#bduefvfrSeidzoeps<[tNyCpCeL>_,P RNOCTCOL__SAILMGPOL_E#]#/aNlCgCoL,_ SNTCECPLS_/PsRiOzTeOo_f#(#Tp)r)o t{o > (| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. r u| n group(group( &ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hm:.626w:o9r:k )note: ;in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here \ | ^ 626 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: pfield 'nthreads' will be initialized after field 'tidInBlock'r ims( t562i | d - t i dtSitda(rttiSdc)a,t tnetrh,r enaTdhsr(enatdhsrSecaadtst)e,r ,t iNdUILnLB,l odcikr(etchtr-e>audpI,d xa.rxg)s,- >gsreonudpb(ugfrfo, aurpg)s,- > r| e ^~~~~~~~~~~~~~~~~c vbu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562,: 60 :| ^note: field 'group' will be initialized after field 'stepSize' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 : 53 :t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( tid), 202n | t h r e a d s ( nRtuhnrWeoardksE)l,e mteindtI((g)r.oruupn)(,w e )| ; ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 562 | : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nthreads(nthr e562a | d s ) , ttiidd(ItniBdl)o,c kn(tthhrreeaaddsI(dnxt.hxr)e,a dgsr)o,u pt(igdrIonuBpl)o,c k (| t ^~~~~~~~~~~h readIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/NCCL:_562S:T15E:P Swarning: /initializer order does not match the declaration order [-Wreorder-ctor]s izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 562| | group(group tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:)655,: 11n:t hnote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ads(n t655h | r e a d s ) , t i dpIrniBmlso(ctki(dt-htriedaSdtIadrxt.Rxe)d,u cger,o unpT(hgrreoaudps)R,e d u| c ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e , | n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u llptr ,563 | & d i r esctte-p>Soiuzte,( nacrcglsS-h>mseemn.dcboumfmf.,b uafrfgSsi-z>erse[cNvCbCuLf_fP,R O T| O ^_ SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h]:/202N:C53C:L _note: Sin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereT EPS/ s202i | z e o f ( T ) ) R{u n W| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r k E| l group(groupe ment, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oto>() .687r | u n ( w e ) ; | ^p rims(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppi:d6-:t1i:d Snote: tin instantiation of member function 'RunWork, 2, 2>::run' requested herea rtBc a6s | tI,M PnLT_hCrOeLaLd_sFBUcNaCs(tA,l l&Rdeidrueccet,- >CoOuLtL,N EnTu_lDlIpRtErC,T ,a rSgIsM-P>LsEe,n dMbiunf,f ,i natr3g2s_-t>)r e c| v^b uff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^95 : note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202 :39153 | : note: Rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu nWork <202n | c c l F u n c # #RfuunnWco,r ktEylpeem,e nFtuo,, NPCrCoLt_oA>L(G)O._r#u#na(lwgeo),; N C| C ^L _PROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp_:#7#:p1r:o tnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested here> ().r u7n | (I&MnPcLc_lCSOhLmLe_mF.UwNoCr(kA)l;l R\e d u| c ^e , CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562N:E15T:_ Dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'R ECT, S562I | M P L E ,t iMdi(nt,i du)i,n tn3t2h_rte)a d s| (^n threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391):,95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'I nBlock (391t | h r eRaudnIWdoxr.kx<)n,c cglrFouunpc(#g#rfouunpc),, t y| p ^~~~~~~~~~~~~~~~~e , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:u562n:c60#:# dnote: efield 'group' will be initialized after field 'stepSize'v redo p562< | t y p e >t,i dN(CtCiLd_)A,L GnOt_h#r#eaaldgso(,n tNhCrCeLa_dPsR)O,T Ot_i#d#IpnrBoltooc>k(()t.hrruena(d&Indcxc.lxS)h,m egmr.owuopr(kg)r;o u\p ) ,| ^ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562A:L15G:O _warning: #initializer order does not match the declaration order [-Wreorder-ctor]# algo, NC C562L | _ P R O TtOi_d#(#tpirdo)t,o >n(t)h.rreuand(s&(nnctchlrSehamdesm).,w otrikd)I;n B\l o c| k ^( threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x15):, note: gfield 'nthreads' will be initialized after field 'tidInBlock'r oup(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , nth r563e | a d s ( nsttherpeSaidzse)(,n ctcildSIhnmBelmo.ccko(mtmh.rbeuafdfISdixz.exs)[,N CgCrLo_uPpR(OgTrOo_uSpI)M,P L E| ] ^~~~~~~~~~~~~~~~~/ NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562S:T60E:P Snote: /field 'group' will be initialized after field 'stepSize's izeof (562T | ) ) { t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( t i| d group(group) , nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:s666(:n9t:h rnote: ein instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herea ds), t666i | d I n B l o c k (ptrhirmesa(dtIiddx,. xn)T,h rgeraoduspG(agtrhoeurp,) ,d i r| e ^~~~~~~~~~~c t->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdxnthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::391562::9515:: note: warning: expanded from macro 'IMPL_COLL_FUNC'initializer order does not match the declaration order [-Wreorder-ctor] 391 | R u562n | W o r k h,r eNaCdCILd_xA.LxG)O,_ #g#raolugpo(,g rNoCuCpL)_,P R O| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O _ #| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)p roto>( )563. | r u n ( &sntcecplSSihzmee(mn.cwcolrSkh)m;e m\. c o| m ^m .bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562S:i15z:e snote: [field 'nthreads' will be initialized after field 'tidInBlock'N CCL_ P562R | O T O _ StIiMdP(LtEi]d/)N,C CnLt_hSrTeEaPdSs/(snitzheroefa(dTs))), {t i d| I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n B l| o group(groupc k(threadIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hx:)677,: 11g:r onote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herep (group) ,677 | | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60 :p rnote: ifield 'group' will be initialized after field 'stepSize'm s(ti d562- | t i d S ttairdt(Btciads)t,, nntThhrreeaaddss(Bnctahsrte,a d&sd)i,r etcitd-I>noBulto,c kd(itrherceta-d>Iddoxw.nx,) ,a rggrso-u>ps(egnrdobuupf)f,, a| r ^~~~~~~~~~~g s->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562s:)15,: twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d InBlock (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~d s), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I60n:B lnote: ofield 'group' will be initialized after field 'stepSize'c k(thre a562d | I d x . xt)i,d (gtriodu)p,( gnrtohurpe)a,d s (| n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a ds), 563t | i d I n BsltoecpkS(itzher(enacdcIldSxh.mxe)m,. cgormomu.pb(ugfrfoSuipz)e,s [ N| C ^~~~~~~~~~~C L_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :{562 : 15| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ warning: initializer order does not match the declaration order [-Wreorder-ctor]| group(group 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 687 : 11 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid) ,687 | n t h r e a d s ( n tphrriemasd(st)i,d -ttiiddISntBalrotcBkc(atshtr,e andTIhdrxe.axd)s,B cgarsotu,p (&gdrioruepc)t,- > o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t , | n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u llptr, 563a | r g s - >ssteenpdSbiuzfef(,n cacrlgSsh-m>erme.ccvobmumf.fb,u f f| S ^i zes[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:C202L:_53P:R Onote: Tin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereO _SI M202P | L E ] / N C C L _RSuTnEWPoSr/ksEilzeemoefn(tT<)F)n ,{ T ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R e d| O group(groupp , Algo, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:t655o:>11(:) .note: rin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu n(we); 655| | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp : 6 : 1 :p rnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested herem s(t i6d | -ItMiPdLS_tCaOrLtLR_eFdUuNcCe(,A lnlTRherdeuacdes,R eCdOuLcLeN,E Tn_uDlIlRpEtCrT,, &SdIiMrPeLcEt,- >Moiunt,, ianrtg3s2-_>ts)e n d| b^u ff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :a391r:g95s:- >note: rexpanded from macro 'IMPL_COLL_FUNC'e cvbuf f391, | | R ^u nWork, 2, 2>::run' requested hereu nc, t y202p | e , F u n c # #RduenvWroerdkoEpl<,F nN,C CTL,_ ARLeGdOO_p#,# aAllggoo,, NPCrCoLt_oP>R(O)T.Or_u#n#(pwreo)t;o > (| ) ^. run(&nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppc:l6S:h1m:e mnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herew ork); 6\ | I M| P ^L _COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562F:U15N:C (note: Afield 'nthreads' will be initialized after field 'tidInBlock'l lRed u562c | e , C OtLiLdN(EtTi_dD)I,R EnCtTh,r eSaIdMsP(LnEt,h Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h), tid:I562n:B15l:o cwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]( threadIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~e ads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r60e:a dnote: sfield 'group' will be initialized after field 'stepSize') , ti d562I | n B l o ctki(dt(htrieda)d,I dnxt.hxr)e,a dgsr(onutph(rgeraodusp)),, t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I n B| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o ck(t h563r | e a d I dsxt.exp)S,i zger(onucpc(lgSrhomuepm).,c o m| m ^~~~~~~~~~~. buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]) , tidInBl o562c | k ( t h rteid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppa:7:d1I:d xnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herex ), group (7g | rIoMuPpL)_,C O L| L ^~~~~~~~~~~~~~~~~_ FUNC(AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:c562e:,60 :C Onote: Lfield 'group' will be initialized after field 'stepSize'L NET_DIR E562C | T , S ItMiPdL(Et,i dM)i,n ,n tuhirneta3d2s_(tn)t h r| e^a ds), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I391n:B95l:o cnote: kexpanded from macro 'IMPL_COLL_FUNC'( thread I391d | x . xR)u,n Wgorroku, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:n562,: 15u:i nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]6 4_t) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:(95t:i dnote: )expanded from macro 'IMPL_COLL_FUNC', nthre a391d | s ( nRtuhnrWeoardks<)n,c ctliFduInncB#l#ofcukn(ct,h rteyapdeI,d xF.uxn)c,# #gdreovurpe(dgorpo ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C C| L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ ALGO _563# | # a l g os,t eNpCSCiLz_eP(RnOcTcOl_S#h#mpermo.tcoo>m(m)..bruufnf(S&inzcecsl[SNhCmCeLm_.PwRoOrTkO)_;S I\M P L| E ^] /NCCL_ST/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:P562S:/15s:i znote: efield 'nthreads' will be initialized after field 'tidInBlock'o f(T)) {562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| i group(groupd (tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r626e:a9d:s (note: nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hread s626) | , t i d I n B lporcikm(st(htrieda-dtIiddxS.txa)r,t Sgcraotutpe(rg,r onuTph)r,e a d| s ^~~~~~~~~~~~~~~~~S catt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:r562,: 60N:U Lnote: Lfield 'group' will be initialized after field 'stepSize', direc t562- | > u p , tairdg(st-i>ds)e,n dnbtuhfrfe,a dasr(gnst-h>rreeacdvsb)u,f ft,i d I| n ^B lock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:I dnote: xin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. x), g202r | o u p ( g r o u pR)u,n W o| r ^~~~~~~~~~~k Element().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hrk:)562;: 15\: warning: | initializer order does not match the declaration order [-Wreorder-ctor] ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :56215 | : note: field 'nthreads' will be initialized after field 'tidInBlock' tid(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( grou p563) | , | ^~~~~~~~~~~~~~~~~s tepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:z562e:(60n:c cnote: lfield 'group' will be initialized after field 'stepSize'S hmem. c562o | m m . b utfifdS(itzieds)[,N CnCtLh_rPeRaOdTsO(_nStIhMrPeLaEd]s/)N,C CtLi_dSITnEBPlSo/cski(ztehorfe(aTd)I)d x{. x )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ g r| o group(groupu p(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 641| : ^~~~~~~~~~~11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: em.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562t:,15 :a rwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]s ->sendbuf f562, | a r g st-i>dr(etcivdb)u,f fn,t h r| e ^a ds(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s202):,53 :t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereI nBloc k202( | t h r e a d I d xR.uxn)W,o rgkrEoluepm(egnrtoe(p)S.irzuen((nwcec)l;S h m| e ^m .comm./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppb:u7f:f1S:i znote: ein instantiation of member function 'RunWork, 2, 2>::run' requested heres [NCC L7_ | PIRMOPTLO__CSOILMLP_LFEU]N/CN(CAClLl_RSeTdEuPcSe/,s iCzOeLoLfN(ETT)_)D I{R E C| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, S| I group(groupM PLE, Min/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 641u:i11n:t 3note: 2in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here_ t) | ^ 641 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95 : note: pexpanded from macro 'IMPL_COLL_FUNC'r ims(tid -391t | i d SRtuanrWtoRrekdddeovwrne,d o&pd-,> oNuCtC,L _aArLgGsO-_>#s#eanldgbou,f fN,C CaLr_gPsR-O>TrOe_c#v#bpurfoft,o > (| ) ^. run(&/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c202c:l53S:h mnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herem .wor k202) | ; \ | ^ RunWo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562E:l15e:m enote: nfield 'nthreads' will be initialized after field 'tidInBlock't e(a)d.sr(unnt(hwree)a;d s )| , ^ tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppn:B8l:o1c:k (note: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh read I8d | xI.MxP)L,_ CgOrLoLu_pF(UgNrCo(uApl)l,R e d| u ^~~~~~~~~~~~~~~~~c e, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:O562L:L60N:E Tnote: _field 'group' will be initialized after field 'stepSize'D IRECT ,562 | S I M P LtEi,d (Mtiind,) ,i nntt6h4r_eta)d s (| n^t hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:s391):,95 :t inote: dexpanded from macro 'IMPL_COLL_FUNC'I nBlock (391t | h r eRaudnIWdoxr.kx<)n,c cglrFouunpc(#g#rfouunpc),, t y| p ^~~~~~~~~~~e , Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ L_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:#562p:r15o:t owarning: >initializer order does not match the declaration order [-Wreorder-ctor]( ).run(&ncc l562S | h m e m .twiodr(kt)i;d )\, n| t ^h reads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:h562r:e15a:d snote: )field 'nthreads' will be initialized after field 'tidInBlock', tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Block (563t | h r e a dsItdexp.Sxi)z,e (gnrcoculpS(hgmreomu.pc)o,m m .| b ^~~~~~~~~~~~~~~~~u ff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60s:[ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(threadI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:x655.:x11):, note: gin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oup(gr o655u | p ) , | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hlShmem:.562c:o15m:m .warning: binitializer order does not match the declaration order [-Wreorder-ctor]u ffSizes[NCCL_PR O562T | O _ S I MtPiLdE(]t/iNdC)C,L _nStThErPeSa/dssi(znetohfr(eTa)d)s ){, t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d I n| B group(groupl ock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a687d:I11d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , gro u687p | ( g r o u p ) , | p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r i m| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( tid-t i563d | S t a r tsBtceapsSti,z en(TnhcrcelaSdhsmBecma.scto,m m&.dbiurfefcSti-z>eosu[tN,C CnLu_lPlRpOtTrO,_ SaIrMgPsL-E>]s/eNnCdCbLu_fSfT,E PaSr/gssi-z>eroefc(vTb)u)f f{, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h202: | 641 : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here RunWo r641k | E l e m e n t < F n ,p rTi,m sR(etdiOdp-,t iAdlSgtoa,r tPRreodtuoc>e(,) .nrTuhnr(ewaed)s;R e d| u ^c e, d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppi:r8e:c1t:- >note: din instantiation of member function 'RunWork, 2, 2>::run' requested hereo wn, 8& | dIiMrPeLc_tC-O>LoLu_tF,U NaCr(gAsl-l>Rseednudcbeu,f fC,O LaLrNgEsT-_>DrIeRcEvCbTu,f fS,I M P| L ^E , Min, i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t2026:453_:t )note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | ^ 202/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' RunWo r391k | E l eRmuennWtou(n)c.#r#udne(vwree)d;o p <| t ^y pe>, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppC:C9L:_1A:L Gnote: Oin instantiation of member function 'RunWork, 2, 2>::run' requested here_ ##al g9o | ,I MNPCLC_LC_OPLRLO_TFOU_N#C#(pArloltRoe>d(u)c.er,u nC(O&LnLcNcElTS_hDmIeRmE.CwTo,r kS)I;M P\L E ,| ^M in,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :u562i:n15t:6 4note: _field 'nthreads' will be initialized after field 'tidInBlock't ) | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95(:t inote: dexpanded from macro 'IMPL_COLL_FUNC') , nthr e391a | d s (RnutnhWroerakd ,| : ^~~~~~~~~~~~~~~~~562N :C15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC::L 562_warning: :Ainitializer order does not match the declaration order [-Wreorder-ctor]60L :G Onote: _field 'group' will be initialized after field 'stepSize'# #algo ,562 | N C C562 L | _ tP iR dO (TtOti_id#d(#)tp,ir don)tt,oh >rn(et)ah.drrseu(ann(dt&snh(crncetlahSdrhsem)ae,dm .stw)io,dr Iktn)iB;dl Io\nc Bk l| (o ^tc hkr(etahdrIe/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hda:xd562.:Ix15d:)x ,.note: field 'nthreads' will be initialized after field 'tidInBlock'xg )r,o ugpr (o562gu | rp o( ug pr )ot,ui pd )(| ,t ^~~~~~~~~~~ i d| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, n| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)h reads (563n | t h r e asdtse)p,S itzied(InncBclloSchkm(etmh.rceoamdmI.dbxu.fxf)S,i zgerso[uNpC(CgLr_oPuRpO)T,O _ S| I ^~~~~~~~~~~~~~~~~M PLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h]:/562N:C60C:L _note: Sfield 'group' will be initialized after field 'stepSize'T EPS/ s562i | z e o f (tTi)d)( t{i d )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ n t| h group(groupr eads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 626t:i9d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(th r626e | a d I d x . x ) ,p rgirmosu(pt(igdr-otuipd)S,t a r| t ^~~~~~~~~~~S catter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:S562c:a15t:t ewarning: rinitializer order does not match the declaration order [-Wreorder-ctor], NULL, di r562e | c t - > utpi,d (atrigds)-,> snetnhdrbeuafdfs,( natrhgrse-a>drse)c,v btuifdfI,n B l| o ^c k(thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:d202x:.53x:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg roup( g202r | o u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~R u n| W tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o rkEle m563e | n t < F ns,t eTp,S iRzeed(Onpc,c lASlhgmoe,m .Pcroomtmo.>b(u)f.frSuinz(ewse[)N;C C L| _ ^P ROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppI:M9P:L1E:] /note: Nin instantiation of member function 'RunWork, 2, 2>::run' requested hereC CL_S T9E | PISM/PsLi_zCeOoLfL(_TF)U)N C{( A l| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R e d| u group(groupc e, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hE:T626_:D9I:R Enote: Cin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereT , SIMP L626E | , M i n , u ipnrti6m4s_(tt)i d -| t^i dStar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:S391c:a95t:t enote: rexpanded from macro 'IMPL_COLL_FUNC', nThrea d391s | S c aRtutneWro,r kNnucp,, tayrpges,- >Fsuenncd#b#udfefv,r eadrogps<-t>yrpeec>v,b uNfCfC,L _ A| L ^G O_##a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:g202o:,53 :N Cnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL _PRO T202O | _ # # p r o t o >R(u)n.Wrournk(E&lnecmcelnSth:(15):. rnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n (we); 562 | | ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp(:t8i:d1):, note: nin instantiation of member function 'RunWork, 2, 2>::run' requested heret hrea d8s | (InMtP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hhL:r_562eC:aO15dL:sL )_warning: ,Finitializer order does not match the declaration order [-Wreorder-ctor] U tNiCd(IAnlBllRoec dk562u( | ct eh ,r e CatOdiLIdLd(NxtEi.Tdx_))D,,I RngEtrChoTru,ep a(SdgIsrM(oPnuLtpEh),r, e Ma id| ns ^~~~~~~~~~~~~~~~~,) ,i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h n:tt562i6:d460I_:nt B)note: l field 'group' will be initialized after field 'stepSize'o c | k^( t562h | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr :e 391a :dIdx 95.t:xi )dnote: ,(expanded from macro 'IMPL_COLL_FUNC' t girdo )u391,p | ( ng trRhourunepWa)od,rs k( .g,br uoNfuCfpCS)Li,_z Ae Ls| G[ ^~~~~~~~~~~ON _C#C#La_lPgRoO,T ON_CSCILM_PPLREO]T/ON_C#C#Lp_rSoTtEoP>S(/)s.irzueno(f&(nTc)c)l S{h m e| m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~. w o| r group(groupk ); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h641::56211::15 :note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: field 'nthreads' will be initialized after field 'tidInBlock' 641 | 562 | t i d ( tpirdi)m,s (nttihdr-etaiddsS(tnatrhtrReeaddusc)e,, tniTdhIrneBaldoscRke(dtuhcree,a ddIidrxe.cxt)-,> dgorwonu,p (&gdrioruepc)t,- > o| u ^~~~~~~~~~~~~~~~~t , a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-60>:s enote: nfield 'group' will be initialized after field 'stepSize'd buff, 562a | r g s - >triedc(vtbiudf)f,, n t| h ^r eads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53s:) ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret idIn B202l | o c k ( t h r e aRduIndWxo.rxk)E,l egmreonutp<(Fgnr,o uT, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :g15r:o uwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]( group), 562 | | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->p), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock' :562 :56215 | : warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid), nth r562e | a d s ( tid(tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p60):, note: field 'group' will be initialized after field 'stepSize'| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563t | i d ( t isdt)e,p Snitzher(enacdcsl(Snhtmherme.acdosm)m,. btuifdfISniBzleosc[kN(CtChLr_ePaRdOITdOx_.SxI)M,P LgEr]o/uNpC(CgLr_oSuTpE)P,S / s| i ^~~~~~~~~~~z eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| ^~~~~~~~~~~ :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllRedtr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(tuce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562):,15 :n twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~r eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562C:O15L:L _warning: Finitializer order does not match the declaration order [-Wreorder-ctor]U NC(AllReduce ,562 | C O L L NtEiTd_(DtIiRdE)C,T ,n tShIrMePaLdEs,( nMtihnr,e audisn)t,6 4t_itd)I n B| l^o ck(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d Inote: dexpanded from macro 'IMPL_COLL_FUNC'x .x), g391r | o u pR(ugnrWoourpk)<,n c c| l ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~F u n| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# #func ,563 | t y p e ,s tFeupnSci#z#ed(envcrceldSohpmm,m .NbCuCfLf_SAiLzGeOs_[#N#CaClLg_oP,R ONTCOC_LS_IPMRPOLTEO]_/#N#CpCrLo_tSoT>E(P)S./rsuinz(e&onfc(cTl)S)h m{e m .| w ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~o r k| ) group(group; \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h15::666 :note: 9field 'nthreads' will be initialized after field 'tidInBlock': note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 666 | t i d ( t i d ) ,p rnitmhsr(etaidds,( nntThhrreeaaddss)G,a tthiedrI,n Bdliorcekc(tt-h>ruepa,d INdUxL.Lx,) ,a rggrso-u>ps(egnrdobuupf)f,, a| r ^~~~~~~~~~~~~~~~~g s->r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:c562v:b60u:f fnote: ,field 'group' will be initialized after field 'stepSize' | ^ 562 | tid(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hread s202) | , t i d I n B lRoucnkW(threadoIrdkxE.lxe)m,e ngtr().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Block( t563h | r e a d Isdtxe.pxS)i,z eg(rnocucpl(Sghrmoeump.)c,o m m| . ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~b u f| f tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S izes [563N | C C L _ PsRtOeTpOS_iSzIeM(PnLcEc]l/SNhCmCeLm_.ScToEmPmS./bsuifzfeSoifz(eTs)[)N C{C L _| P ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~R O T| O group(group_ SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:S655T:E11P:S /note: sin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei zeof(T )655) | { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group prims(tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h-:t677i:d11S:t anote: rin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret Reduc e677, | n T h r e a d s R epdruicmes,( tniudl-ltpitdrS,t a&rdtiBrceacstt-,> onuTth,r eaardgssB-c>assetn,d b&udfifr,e catr-g>so-u>tr,e cdvibruefcft,- > d| o ^w n, args->s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:n202d:b53u:f fnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here args- >202r | e c v b u f f , R u| n ^W orkEl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:m202e:n53t:< Fnote: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here, T, 202R | e d O p , A l gRou,n WPorroktEol>e(m)e.nrtunote: (in instantiation of member function 'RunWork, 2, 2>::run' requested here) .run( w9e | )I;M P L| _ ^C OLL_F/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppU:N10C:(1A:l lnote: Rin instantiation of member function 'RunWork, 2, 2>::run' requested heree duc e10, | ICMOPLLL_NCEOTL_LD_IFRUENCCT(,A lSlIRMePdLuEc,e ,M iCnO,L LuNiEnTt_6D4I_RtE)C T ,| ^S IMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391M:i95n:, note: hexpanded from macro 'IMPL_COLL_FUNC'a lf) | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391u:n95W:o rnote: kexpanded from macro 'IMPL_COLL_FUNC'< ncclF u391n | c # #RfuunnWco,r ktn,c #N#CdCeLv_rAeLdGoOp_<#t#yapleg>o,, NNCCCCLL__APLRGOOT_O#_##a#lpgroo,t oN>C(C)L._rPuRnO(T&On_c#c#lpSrhomteom>.(w)o.rrku)n;( &\n c c| l ^S hmem.w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:r562k:)15;: \note: field 'nthreads' will be initialized after field 'tidInBlock' | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562: | 562 : 15 : tnote: ifield 'nthreads' will be initialized after field 'tidInBlock'd (tid )562, | n t h rteiadd(st(indt)h,r enatdhsr)e,a dtsi(dnItnhBrleoacdks()t,h rteiaddIIndBxl.oxc)k,( tghrroeuapd(Igdrxo.uxp)),, g r| o ^~~~~~~~~~~~~~~~~u p(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ho:u562p:)60,: note: | field 'group' will be initialized after field 'stepSize' ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562: 60562: | note: field 'group' will be initialized after field 'stepSize' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~r oup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ^~~~~~~~~~~: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h), ti:d562I:n15B:l owarning: cinitializer order does not match the declaration order [-Wreorder-ctor]k (thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hads(n:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]) , tidInBlock(th r562e | a d I d xt.ixd)(,t igdr)o,u pn(tghrroeuapd)s,( n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 60t:i dnote: Ifield 'group' will be initialized after field 'stepSize'n Bloc k562( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t idIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~z es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .warning: xinitializer order does not match the declaration order [-Wreorder-ctor]) , group(group) ,562 | | ^~~~~~~~~~~~~~~~~ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i60d:) ,note: field 'group' will be initialized after field 'stepSize'n thre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) group( g563r | o u p ) ,s t e| p ^~~~~~~~~~~S ize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educe, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): 60: note: 563field 'group' will be initialized after field 'stepSize' | s t562e | p S i z et(indc(ctliSdh)m,e mn.tchormema.dbsu(fnftShirzeeasd[sN)C,C Lt_iPdRIOnTBOl_oScIkM(PtLhEr]e/aNdCICdLx_.SxT)E,P Sg/rsoiuzpe(ogfr(oTu)p)) ,{ | | ^~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hMi:n562,: 15h:a lwarning: finitializer order does not match the declaration order [-Wreorder-ctor]) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :562391 | : 95 : note: texpanded from macro 'IMPL_COLL_FUNC'i d(tid) ,391 | n t hRruenaWdosr(knr,o uNpC)C,L _ A| L ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~G O _| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# algo ,563 | N C C L _sPtReOpTSOi_z#e#(pnrcoctloS>h(m)e.mr.ucno(m&mn.cbculfSfhSmiezme.sw[oNrCkC)L;_ P\R O T| O ^_ SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:]562/:N15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'S TEPS/s i562z | e o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r666e:a9d:s )note: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tidI n666B | l o c k ( t h r epardiImdsx(.txi)d,, gnrTohurpe(agdrsoGuapt)h,e r ,| ^~~~~~~~~~~~~~~~~d ire/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:t562-:>60u:p ,note: field 'group' will be initialized after field 'stepSize'N ULL, a562r | g s - > steindd(btuifdf),, anrtghsr-e>ardesc(vnbtuhfrfe,a d s| ) ^, tidInB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hl:o202c:k53(:t hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree adId x202. | x ) , g r o u pR(ugnrWoourpk)E,l e m| e ^~~~~~~~~~~n t().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :687562 | : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] pri m562s | ( t i d -ttiidd(Sttiadr)t,B cnatshtr,e andTsh(rnetahdrseBacdass)t,, t&iddiIrneBclto-c>ko(utth,r enaudlIldpxt.rx,) ,a rggrso-u>ps(egnrdobuupf)f,, a| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~g s -| > tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ecvbu f563f | , | ^s tepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:c202l:S53h:m enote: min instantiation of member function 'RunWorkElement, 2, 2>::run' requested here. comm .202b | u f f S i z e s [RNuCnCWLo_rPkREOlTeOm_eSnItM ({) . r| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n ( w| e group(group) ; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:: 11note: :in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 666 | 11 | I M P Lp_rCiOmLsL(_tFiUdN,C (nATlhlrReeadduscGea,t hCeOrL,L NdEiTr_eDcItR-E>CuTp,, SNIUMLPLL,E ,a rMgisn-,> sfelnodabtu)f | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ f, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: :warning: 562initializer order does not match the declaration order [-Wreorder-ctor]: 15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t562i | d ( t i dt)i,d (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 563 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | s563t | e p S i zset(enpcScilzSeh(mnecmc.lcSohmmme.mb.ucfofmSmi.zbeusf[fNSCiCzLe_sP[RNOCTCOL__SPIRMOPTLOE_]S/INMCPCLLE_]S/TNECPCSL/_sSiTzEePoSf/(sTi)z)e o{f ( T| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) {| group(group | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 626note: :in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here9 : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | 626 | p r ipmrsi(mtsi(dt-itdi-dtSitdaSrttaBrctaSscta,t tneTrh,r enaTdhsrBecaadsstS,c a&tdtierre,c tN-U>LoLu,t ,d inruelcltp-t>ru,p ,a ragrsg-s>-s>esnednbdubfuff,f ,a ragrsg-s>-r>ercevcbvubfuff,f , | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202::20253::53 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herenote: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R uRnuWnoWrokrEklEelmeemnetno(>)(.)r.urnu(nw(ew)e;) ; | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp10::111::1 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | 11I | MIPMLP_LC_OCLOLL_LF_UFNUCN(CA(lAllRleRdeudcuec,e ,C OCLOLLNLENTE_TD_IDRIERCETC,T ,S ISMIPMLPEL,E ,M iMni,n ,h afllfo)a t )| ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h391::39195::95 :note: expanded from macro 'IMPL_COLL_FUNC'note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391 | R uRnuWnoWrokre,> ,N CNCCLC_LA_LAGLOG_O#_##a#laglog,o ,N CNCCLC_LP_RPORTOOT_O#_##p#rportoot>o(>)(.)r.urnu(n&(n&cncclcSlhSmhemme.mw.owrokr)k;) ;\ \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrourpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56260::60 :note: field 'group' will be initialized after field 'stepSize'note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~| ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hG:O562_:#15#:a lwarning: ginitializer order does not match the declaration order [-Wreorder-ctor]o , NCCL_ P562R | O T O _ #t#ipdr(ottiod>)(,) .nrtuhnr(e&andcsc(lnSthhmreema.dwso)r,k )t;i d\I n B| l ^o ck(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:d562I:d15x:. xnote: )field 'nthreads' will be initialized after field 'tidInBlock', group( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t i d| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), nth r563e | a d s ( nsttherpeSaidzse)(,n ctcildSIhnmBelmo.ccko(mtmh.rbeuafdfISdixz.exs)[,N CgCrLo_uPpR(OgTrOo_uSpI)M,P L E| ] ^~~~~~~~~~~~~~~~~/ NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:T562E:P60S:/ snote: ifield 'group' will be initialized after field 'stepSize'z eof(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h641r:e11a:d snote: (in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren threa d641s | ) , t i d I n B l opcrki(mtsh(rteiadd-Itdixd.Sxt)a,r tgRreoduupc(eg,r onuTph)r,e a d| s ^~~~~~~~~~~R educe, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorke,a dNsC(CnLt_hArLeGaOd_s#)#,a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h l:tg562io:d,15I :nN BCwarning: lCinitializer order does not match the declaration order [-Wreorder-ctor]oL c_kP(RtOh Tr562Oe | _a #d #I pd rxto.itxdo)(>,t( i)gd.r)ro,uu npn((t&ghnrrcoecualpdS)sh,(m ne tm| h. ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~rw eo ar| dk tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)s) );, \t i563 d | | I ^n B l o/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hsc:tk562e(:pt15Sh:ir zenote: eafield 'nthreads' will be initialized after field 'tidInBlock'(d nIcdcx l.562Sx | h) m, e mg .rtcoioudmp(m(t.gibrduo)fu,fp S)ni,tz he rs| e[ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~aN dC sC| (L tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n_ tPhRrO eT563aO | d_ sS )I ,M PstLtiEed]pI/SnNiBCzlCeoL(c_nkSc(TctElhPSrShe/masedimIz.decxoo.fmx(m)T.,)b )ug fr{fo Su ip| z( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eg sr [o| Nu group(groupCp C)L,_ P R| O/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h ^~~~~~~~~~~~~~~~~T: O677_:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS11:I:562M :Pnote: 60Lin instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here:E ]note: /field 'group' will be initialized after field 'stepSize'N C C677L | _ 562S | T E P S / ts ii dz (eptorifid(m)Ts,)( )tn it{dh -triedaSdtsa(rnttBhcraesatd,s )n,T htriedaIdnsBBlcoacskt(,t h&rdeiardeIcdtx-.>xo)u,t ,g rdoiurpe(cgtr-o>udpo)w,n , | a ^~~~~~~~~~~r gs->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rect->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:A562l:l15R:e dwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]c e, COLLN E562T | _ D I R EtCiTd,( tSiIdM)P,L En,t hMriena,d sh(anltfh)r e a| d^s ), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I391n:B95l:o cnote: kexpanded from macro 'IMPL_COLL_FUNC'( threadId x391. | x ) ,R ugnrWoourpk(e,m .NcCoCmLm_.AbLuGfOf_S#i#zaelsg[oN,C CNLC_CPLR_OPTROO_TSOI_M#P#LpEr]o/tNoC>C(L)_.SrTuEnP(S&/nsciczleSohfm(eTm).)w o{r k )| ; ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ \ | group(group| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h::562677::1511:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 562677 | | t i d ( t i dp)r,i mnst(htrieda-dtsi(dnSttharretaBdcsa)s,t ,t indTIhnrBelaodcskB(ctahsrte,a d&Iddixr.exc)t,- >goruotu,p (dgirroeucpt)-,> d o| w ^~~~~~~~~~~~~~~~~n , arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:-562>:s60e:n dnote: bfield 'group' will be initialized after field 'stepSize'u ff, arg s562- | > r e c vtbiudf(ft,i d )| , ^ nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:(53n:t hnote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heree ads )202, | t i d I n B l oRcukn(WtohrrkeEaldeImdexn.tx<)Fn, T, R,e dgOrpo,u pA(lggroo,u pP)r,o t o| > ^~~~~~~~~~~( ).run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:(15t:i dwarning: )initializer order does not match the declaration order [-Wreorder-ctor], nthreads (562n | t h r e atdisd)(,t itdi)d,I nnBtlhorceka(dtsh(rnetahdrIedaxd.sx)),, tgirdoIunpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~I dx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,60 :g rnote: ofield 'group' will be initialized after field 'stepSize'u p(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d (| tid), n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hreads (563n | t h r e asdtse)p,S itzied(InncBclloSchkm(etmh.rceoamdmI.dbxu.fxf)S,i zgerso[uNpC(CgLr_oPuRpO)T,O _ S| I ^~~~~~~~~~~M PLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :t iwarning: dinitializer order does not match the declaration order [-Wreorder-ctor]( tid), nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~. x )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) group (563g | r o u p )s,t e p| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i z e| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n cclSh m563e | m . c o msmt.ebpuSfifzSei(znecsc[lNSChCmLe_mP.RcOoTmOm_.SbIuMfPfLSEi]z/eNsC[CNLC_CSLT_EPPRSO/TsOi_zSeIoMfP(LTE)])/ N{C C L| _ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S T E| P group(groupS /sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h): 687{: 11 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | group(group 687 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 677 : 11p:r inote: min instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (tid- t677i | d S t a rtBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElementwarning: (initializer order does not match the declaration order [-Wreorder-ctor]) .run(we); 562| | ^ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppi:d12(:t1i:d )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here nth r12e | aIdMsP(Ln_tChOrLeLa_dFsU)N,C (tAildlIRneBdluoccek,( tChOrLeLaNdEITd_xD.IxR)E,C Tg,r oSuIpM(PgLrEo,u pM)i,n , | d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u b| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)e ) | ^ 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 : 95s:t enote: pexpanded from macro 'IMPL_COLL_FUNC'S ize(n c391c | l S hmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tid I562n | B l o c kt(itdh(rteiadd)I,d xn.txh)r,e agdrso(unpt(hgrreoaudps)),, t| i ^~~~~~~~~~~d InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FRunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>()U.run(N&Cn(cAclllSRhemdeumce, .CwOoLrLkN)E;T _\D I R| E ^C T, SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:E562,: 15M:i nnote: ,field 'nthreads' will be initialized after field 'tidInBlock' rccl_ b562f | l o a t 1t6i)d ( t| i^d ), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:t391h:r95e:a dnote: sexpanded from macro 'IMPL_COLL_FUNC') , tidIn B391l | o c kR(utnhWroerakd, 562N | C C L _ AtLiGdO(_t#i#da)l,g on,t hNrCeCaLd_sP(RnOtThOr_e#a#dpsr)o,t ot>i(d)I.nrBulno(c&kn(ctchlrSehamdeImd.xw.oxr)k,) ;g r\o u p| ( ^g roup), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:u562n:(15w:e )warning: ;initializer order does not match the declaration order [-Wreorder-ctor] | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp562: | 12 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid )12, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pM(ignr,o udpo)u,b l e| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391563: | 95 : note: expanded from macro 'IMPL_COLL_FUNC's tepSiz e391( | n c cRluSnhWmoermk.P,S /NsCiCzLe_oAfL(GTO)_)# #{a l g| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, N| C group(groupC L_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:_677#:#11p:r onote: tin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereo >().r u677n | ( & n c c l S h m e mp.rwiomrsk()t;i d\- t i| d ^S tartB/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:a562s:t15,: nnote: Tfield 'nthreads' will be initialized after field 'tidInBlock'h reads B562c | a s t , t&iddi(rteicdt)-,> onutth,r edaidrse(cntt-h>rdeoawdns,) ,a rtgisd-I>nsBelnodcbku(ftfh,r eaardgIsd-x>.rxe)c,v bgurfofu,p ( g| r ^o up), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~~~~~~~: 202:53/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here60 : note: field 'group' will be initialized after field 'stepSize' 202 | 562 | RtuindW(otrikdE)l,e mnetnhtr((t)h.rreuand(Iwdex).;x ) ,| ^g roup(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppg:r12o:u1p:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~ 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Twarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ ##proto>() .562r | u n ( & ntcicdl(Sthimde)m,. wnotrhkr)e;a d\s ( n| t ^h reads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock( t562h | r e a d Itdixd.(xt)i,d )g,r onutph(rgeraodusp()n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), tidI n563B | l o c k (sttherpeSaidzIed(xn.cxc)l,S hgmreomu.pc(ogmrmo.ubpu)f,f S i| z ^~~~~~~~~~~~~~~~~e s[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60P:R Onote: Tfield 'group' will be initialized after field 'stepSize'O _SIM P562L | E ] / N CtCiLd_(StTiEdP)S,/ snitzheroefa(dTs)()n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s )| , group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_D IRECTt,i dSIInMBPlLoEc,k (Mtihnr,e ardcIcdlx_.bxf)l,o agtr1o6u)p ( g| r^o up),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : ^~~~~~~~~~~95 : note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15::562 :warning: 15initializer order does not match the declaration order [-Wreorder-ctor]: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t esptSeipzSei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNLC_CPLR_OPTROO_TSOI_MSPILMEP]L/EN]C/CNLC_CSLT_ESPTSE/PsSi/zseiozfe(oTf)()T ){) {| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h655::68711::11 :note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herenote: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | 687 | p rpirmism(st(itdi-dt-itdiSdtSatratrRteBdcuacset,, nnTThhrreeaaddssRBecdauscte,, &nduilrlepcttr-,> o&udti,r encutl-l>poturt,, aarrggss-->>sseennddbbuuffff,, aarrggss-->>rreeccvvbbuuffff,, | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::202202::5353:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herein instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202202 | | RRuunnWWoorrkkEElleemmeenntt<>(())..rruunn((wwee));; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp13::131::1 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | 13I | MIPMLP_LC_OCLOLL_LF_UFNUCN(CA(lAllRleRdeudcuec,e ,C OCLOLLNLENTE_TD_IDRIERCETC,T ,S ISMIPMLPEL,E ,M iMni,n ,r crcclc_lb_fblfolaota1t61)6 ) | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::391391::9595:: note: note: expanded from macro 'IMPL_COLL_FUNC'expanded from macro 'IMPL_COLL_FUNC' 391391 | | RRuunnWWoorrkk<>,, NNCCCCLL__AALLGGOO__####aallggoo,, NNCCCCLL__PPRROOTTOO__####pprroottoo>>(())..rruunn((&&nnccccllSShhmmeemm..wwoorrkk));; \\ | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::15 :note: field 'nthreads' will be initialized after field 'tidInBlock'note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::60562:: 60note: :field 'group' will be initialized after field 'stepSize' note: field 'group' will be initialized after field 'stepSize' 562 | 562 | t i d (ttiidd()t,i dn)t,h rnetahdrse(andtsh(rnetahdrse)a,d st)i,d ItniBdlIoncBkl(otchkr(etahdrIedaxd.Ixd)x,. xg)r,o ugpr(ogurpo(ugpr)o,u p )| , ^~~~~~~~~~~ | ^~~~~~~~~~~ oto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here562 :15: 202warning: | initializer order does not match the declaration order [-Wreorder-ctor] RunWor k562E | l e m e ntti,( )t.irduInn(Bwleo)c;k ( t| h ^r eadId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppx:.12x:)1,: gnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereo up(g r12o | uIpM)P,L _ C| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~L L _| F tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)UN C(AllR e563d | u c e , sCtOeLpLSNiEzTe_(DnIcRcElShmemC.Tc,o mSmI.MbPuLfEf,S iMziens,[ NdCoCuLb_lPeR)O T O| _^S IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:E391]:/95N:C Cnote: Lexpanded from macro 'IMPL_COLL_FUNC'_ STEPS /391s | i z eRoufn(WTo)r)k <{n c c| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~F u n| c group(group# #func, type/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677F:u11n:c #note: #in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered evredo p677< | t y p e > , N C C Lp_rAiLmGsO(_t#i#da-ltgiod,S tNaCrCtLB_cPaRsOtT,O _n#T#hprreoatdos>B(c)a.srtu,n (&&dnicrcelcSth-m>eomu.tw,o rdki)r;e c\t - >| d ^o wn, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ha:r562g:s15-:> snote: efield 'nthreads' will be initialized after field 'tidInBlock'n dbuf f562, | a r g st-i>dr(etcivdb)u,f fn,t h r| e ^a ds(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:)53,: tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered InBl o202c | k ( t h r e a d IRduxn.Wxo)r,k Eglreomuepn(tg562( | ) . r u nt(iwde()t;i d )| , ^ nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpps:(13n:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered s), t i13d | IInMBPlLo_cCkO(LtLh_rFeUaNdCI(dAxl.lxR)e,d ugcreo,u pC(OgLrLoNuEpT)_,D I R| E ^~~~~~~~~~~C T, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h, gro:u562p:(15g:r owarning: uinitializer order does not match the declaration order [-Wreorder-ctor]p ), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidSta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:tB562c:a15s:t ,warning: initializer order does not match the declaration order [-Wreorder-ctor]n ThreadsBcas t562, | & d i rteicdt(-t>iodu)t,, ndtihrreecatd-s>(dnotwhnr,e aadrsg)s,- >tsiednIdnbBulfofc,k (atrhgrse-a>drIedcxv.bxu)f,f ,g r o| u ^p (group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 : 53| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | R u nsWtoerpkSEilzeem(ennctc_(P)R.OrTuOn_(SwIeM)P;L E ]| / ^N CCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppT:E13P:S1/:s inote: zin instantiation of member function 'RunWork, 2, 2>::run' requested heree of( T13) | )I M{P L _| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O L L| _ group(groupF UNC(AllRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:c626e:,9 :C Onote: Lin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL NET_DIR E626C | T , S I M P L Ep,r iMmisn(,t irdc-ctli_dbSftlaorattS1c6a)t t e| r^, nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:s Snote: cexpanded from macro 'IMPL_COLL_FUNC'a tter, 391N | U L LR,u ndWiorrekcculpF,u nacr#g#sf-u>nsce,n dtbyupfef,, Faurngcs#-#>dreevcrvebduofpf<,t y p| e ^> , NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hG:O202_:#53#:a lnote: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereo , NCCL _202P | R O T O _ # # p rRoutnoW>o(r)k.Erluenm(e&nntc562(:)15.:r unote: nfield 'nthreads' will be initialized after field 'tidInBlock'( we); 562 | | ^ tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp,: 13n:t1h:r enote: ain instantiation of member function 'RunWork, 2, 2>::run' requested hered s(nthr e13a | dIsM)P,L _tCiOdLILn_BFlUoNcCk((AtlhlrReeadduIcdex,. xC)O,L LgNrEoTu_pD(IgRrEoCuTp,) ,S I M| P ^~~~~~~~~~~~~~~~~L E,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :M562i:n60,: rnote: cfield 'group' will be initialized after field 'stepSize'c l_b f562l | o a t 1 6t)i d (| t^i d), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:s (note: nexpanded from macro 'IMPL_COLL_FUNC't hreads), 391t | i d IRnuBnlWoocrkk(, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, )n T h| r^e adsReduce, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e391c:t95-:> dnote: oexpanded from macro 'IMPL_COLL_FUNC'w n, &direc t391- | > o uRtu,n Waorrgks<-n>cscelnFdubnucf#f#,f uanrcg,s -t>yrpeec,v bFuufnfc,# # d| e ^v redop:,53 :N Cnote: Cin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereL _ALG O202_ | # # a l g o , NRCuCnLW_oPrRkOETlOe_m#e#nptr (T),. rRuend(O&pn,c cAllSghom,e mP.rwootrok>)(;) .\r u n| ( ^w e); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:: 13note: :field 'nthreads' will be initialized after field 'tidInBlock'1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 13 | ItMiPdL(_tCiOdL)L,_ FnUtNhCr(eAaldlsR(endtuhcree,a dCsO)L,L NtEiTd_IDnIBRlEoCcTk,( tShIrMePaLdEI,d xM.ixn),, rgcrcolu_pb(fglrooautp1)6,) | | ^~~~~~~~~~~~~~~~~^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::39160::95 :note: field 'group' will be initialized after field 'stepSize'note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | tRiudn(Wtoirdk)<,n cnctlhFruenacd#s#(fnutnhcr,e atdysp)e,, tFiudnIcn#B#ldoecvkr(etdhorpe.,x )N,C CgLr_oAuLpG(Og_r#o#uapl)g,o , | N ^~~~~~~~~~~C CL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ to>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx941. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElementm(e)m..rcuonm(mw.eb)u;f f S| i ^z es[NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cppC:L7_:P1R:O Tnote: Oin instantiation of member function 'RunWork, 2, 2>::run' requested here_ SIMP L7E | ]I/MNPCLC_LC_OSLTLE_PFSU/NsCi(zAelolfR(eTd)u)c e{, C| O ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L L N| E group(groupT _DIRECT, SIMPLE, Min,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :u641i:n11t:3 2note: _in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret ) | ^ 641/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' prim s391( | t i dR-utniWdoSrtkaddoopwi,r eNcCtC-L>_oAuLtG,O _a#r#gasl-g>os,e nNdCbCuLf_fP,R OaTrOg_s#-#>prreoctvob>u(f)f.,r u n| ( ^& ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:.202w:o53r:k )note: ;in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here \ | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : Rnote: ufield 'nthreads' will be initialized after field 'tidInBlock'n WorkE l562e | m e n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.:x7):,1 :g rnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested hereu p(gr o7u | pI)M,P L _| C ^~~~~~~~~~~~~~~~~O LL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hF:U562N:C60(:A lnote: lfield 'group' will be initialized after field 'stepSize'R educe ,562 | C O L L NtEiTd_(DtIiRdE)C,T ,n tShIrMePaLdEs,( nMtihnr,e audisn)t,3 2t_itd)I n B| l^o ck(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:I dnote: xexpanded from macro 'IMPL_COLL_FUNC'. x), gr o391u | p ( gRruonuWpo)r,k < n| c ^~~~~~~~~~~c lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 82%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t*/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h :p514t:r9 := warning: rvariable 'offset' set but not used [-Wunused-but-set-variable]e cvP t514r | ( 0 ) + liln1t2 8oOffffsseett ;= t| i ^~~d ; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ds, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l15o:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t hreadIdx. x562) | , g r otuipd((gtriodu)p,) ,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( nthre a563d | s ) , tsitdeIpnSBilzoec(kn(ctchlrSehamdeImd.xc.oxm)m,. bgurfofuSpi(zgerso[uNpC)C,L _ P| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~O T O| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S IMPLE ]563/ | N C C L _sStTeEpPSSi/zsei(znecocfl(STh)m)e m{. c o| m ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m . b| u group(groupf fSizes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hL:_34P:R7O:T Onote: _in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereS IMPLE] /34N | C C L _ S T EpPrSi/mssi(zteiodf,( Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, &| r group(groupi ng->pre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hv:,34 :&7r:i nnote: gin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here- >next, a34r | g s - > s e npdrbiumfsf(,t iadr,g sn-t>hrreecavdbsu,f f&,r ianrgg-s>-p>rreevd,O p&Arrign,g -0>,n eaxrtg,s -a>rcgosn-n>Isnednedxb,u fafr,g sa-r>gcso-n>nrIencdvebxu)f;f , | a ^r gs->re/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hd:O80p:A5r:g ,note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here0 , a r80g | s - > c ornunnIRnidnegx<,T ,a rRgesd-O>pc,o nPnrIontdoe>x()a;r g s| ) ^; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :80:5:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :note: 202in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here: 53: note: 80in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | r202u | n R i n g < T , RRuendWOopr,k EPlreomteon>t(, 1, 2>::run' requested hereo >(). r202u | n ( w e ) ; | R ^u nWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:eM562u:l15S:u mwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] uint64_t )562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:)95,: nnote: texpanded from macro 'IMPL_COLL_FUNC'h reads(n t391h | r e aRdusn)W,o rtki| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) NCCL _563A | L G O _ #s#taelpgSoi,z eN(CnCcLc_lPSRhOmTeOm_.#c#opmrmo.tbou>f(f)S.irzuens([&NnCcCcLl_SPhRmOeTmO._wSoIrMkP)L;E ]\/ N C| C ^L _STEP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:/562s:i15z:e onote: ffield 'nthreads' will be initialized after field 'tidInBlock'( T)) { 562| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(groupt id(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :n34t:h7r:e anote: din instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heres (nthrea d34s | ) , t i d IpnrBilmosc(kt(itdh,r enatdhIrdexa.dxs),, &grrionugp-(>gprroeuvp,) ,& r i| n ^~~~~~~~~~~~~~~~~g ->n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:x562t:,60 :a rnote: gfield 'group' will be initialized after field 'stepSize's ->se n562d | b u f f ,t iadr(gtsi-d>)r,e cnvtbhurfefa,d sa(rngtsh-r>eraeddsO)p,A rtgi,d I0n,B laorcgks(-t>hcroenandIInddxe.xx,) ,a rggrso-u>pc(ognrnoIunpd)e,x ) ;| ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Block(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]//usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hN:C562C:L15_:S Twarning: Einitializer order does not match the declaration order [-Wreorder-ctor]P S/sizeof( T562) | ) { t| i ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d ( t| i group(groupd ), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hn:t34h:r7e:a dnote: sin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , tidIn B34l | o c k ( t h rperaidmIsd(xt.ixd),, ngtrhoruepa(dgsr,o u&pr)i,n g -| > ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p r e| v tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), &rin g563- | > n e x ts,t eaprSgisz-e>(snecncdlbSuhfmfe,m .acrogmsm-.>bruefcfvSbiuzfefs,[ NaCrCgLs_-P>RrOeTdOO_pSAIrMgP,L E0],/ NaCrCgLs_-S>TcEoPnSn/Isnidzeexo,f (aTr)g)s -{> c o| n ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n I n| d group(groupe x); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :34:7: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.hnote: :in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here80 :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 34 | 80 | p rriumnsR(itnigd<,T ,n tRherdeOapd,s ,P r&ortion>g(-a>rpgrse)v;, &| r ^i ng->nex/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:,202 :a53r:g snote: -in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here> send b202u | f f , a r g s -R>urneWcovrbkuEflfe,m eanrtg,r eTd,O pRAerdgO,p ,0 ,A lagrog,s -P>rcootnon>I(n)d.erxu,n (awreg)s;- > c| o ^n nIndex/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp):;8 : 1| : ^ note: in instantiation of member function 'RunWork, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h :880 | :I5M:P Lnote: _in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested hereC OLL _80F | U N C ( RreudnuRcien,g e(MaurlgSsu)m;, i| n ^t 64_t) | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h^: 202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 1, 2>::run' requested here: 391:95: 202note: | expanded from macro 'IMPL_COLL_FUNC' R391u | n W oRruknEWloermked(e)v.rreudno(pw| , ^ NCCL_AL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cppG:O9_:#1#:a lnote: gin instantiation of member function 'RunWork, 1, 2>::run' requested hereo , NC C9L | _IPMRPOLT_OC_O#L#Lp_rFoUtNoC>((R)e.druucne(,& nRcIcNlGS,h mSeImM.PwLoEr,k )P;r e\M u l| S ^u m, uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:65624:_15t:) note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 391 : 95 : tnote: iexpanded from macro 'IMPL_COLL_FUNC'd (tid), 391n | t h rReuandWso(rnkto,u pN)C,C L _| A ^~~~~~~~~~~~~~~~~L GO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:a562l:g60o:, note: Nfield 'group' will be initialized after field 'stepSize'C CL_PR O562T | O _ # # ptriodt(ot>i(d)).,r unnt(h&rnecacdlsS(hnmtehmr.ewaodrsk)),; t\i d I| n ^B lock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'd x.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~t hreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ [NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]n Block(thr e562a | d I d x .txi)d,( tgirdo)u,p (ngtrhoruepa)d,s ( n| t ^~~~~~~~~~~h reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | 202r | u n R i n g < T RunWor,k ERleedmOepn,t ,( aRregdsO)p;, A| l ^g o, Pro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:o202>:(53):. rnote: uin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heren (we) ;202 | | ^ RunWorkElement, 1, 2>::run' requested here, Algo ,10 | PIrMoPtLo_>C(O)L.Lr_uFnU(NwCe()R;e d u| c ^e , RING/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp,: 9S:I1M:P Lnote: Ein instantiation of member function 'RunWork, 1, 2>::run' requested here, Pre M9u | lISMuPmL,_ ChOaLlLf_)F U N| C^( Reduce/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391R:I95N:G ,note: expanded from macro 'IMPL_COLL_FUNC'S IMPLE, 391P | r e MRuulnSWuomr,k r,k <(t)y.preu>n,( &NnCcCcLl_SAhLmGeOm_.#w#oarlkg)o;, \N C C| L ^_ PROTO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h#:p562r:o15t:o >note: (field 'nthreads' will be initialized after field 'tidInBlock') .run(& n562c | c l S h mteimd.(wtoirdk)),; n\t h r| e ^a ds(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s )note: ,field 'nthreads' will be initialized after field 'tidInBlock' tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~~~~~~~t idI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o60c:k (note: tfield 'group' will be initialized after field 'stepSize'h read I562d | x . x) , tgirdo(utpi(dg)r,o unpt)h,r e a| d ^~~~~~~~~~~~~~~~~s (nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a60d:s )note: ,field 'group' will be initialized after field 'stepSize' tidIn B562l | o c k ( tthirde(atdiIdd)x,. xn)t,h rgeraodusp((ngtrhoruepa)d,s ) ,| ^~~~~~~~~~~t idInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1101. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:34:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 34 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce.h:80:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<1, 1>>' requested here 80 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(Reduce, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 77%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), ntIn file included from hrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cppd:s1(: nIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r10e: aIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hs:)167,: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:I15n:B lwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]c k(threadIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~~~~~~~r ead/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t60h:r enote: afield 'group' will be initialized after field 'stepSize'd s), t i562d | I n B l otcikd((tthirde)a,d Indtxh.rxe)a,d sg(rnotuhpr(egardosu)p,) ,t i d| I ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n B l| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)c k(thr e563a | d I d x .sxt)e,p Sgirzoeu(pn(cgcrloSuhpm)e,m . c| o ^~~~~~~~~~~m m.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:N562E:T15_:C Hwarning: Ainitializer order does not match the declaration order [-Wreorder-ctor]I N, SIMP L562E | , P r etMiudl(Stuimd,) ,u inntth3r2e_atd)s ( n| t^h reads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391t:i95d:I nnote: Bexpanded from macro 'IMPL_COLL_FUNC'l ock(thr e391a | d I dRxu.nxW)o,r kgc,l SNhCmCeLm_.AcLoGmOm_.#b#uaflfgSoi,z eNsC[CNLC_CPLR_OPTROO_T#O#_pSrIoMtPoL>E(])/.NrCuCnL(_&SnTcEcPlSS/hsmiezme.owfo(rTk))); {\ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:: 916note: :field 'nthreads' will be initialized after field 'tidInBlock'7 : note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 562 | 916 | t i d ( t ipdr)i,m sn(tghrroeuapdTsi(dn,t hgrreoaudpsN)t,h rteiaddIsn,B l&orcekc(vt,h r&esaednIdd,x .axr)g,s -g>rsoeunpd(bgurfofu,p )a,r g s| - ^~~~~~~~~~~~~~~~~> recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f60,: note: | field 'group' will be initialized after field 'stepSize' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 3, 2>::run' requested hered (tid )202, | n t h r e a d sR(unntWhorrekaEdlse)m,e nttiu(p)(.grruonu(pw)e,) ; | ^~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:(562t:i15d:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor]n threads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~d Idx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,60 :g rnote: ofield 'group' will be initialized after field 'stepSize'u p(gro u562p | ) , | t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d (| t tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)i d), n563t | h r e a dsst(enptShirzeea(dnsc)c,l SthimdeImn.Bcloomcmk.(btuhfrfeSaidzIedsx[.NxC)C,L _gPrRoOuTpO(_gSrIoMuPpL)E,] / N| C ^~~~~~~~~~~C L_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nc, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ .x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:916:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 1, ProtoSimple<1, 1>, 0>::Primitives' requested here 916 | prims(groupTid, groupNthreads, &recv, &send, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 3, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 3, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_CHAIN, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offsIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cppt: 1=: In file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hi:d10;: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h| : ^169 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx908. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flagIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ fset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 83%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ arp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + In file included from 2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp*:w1i: dIn file included from ;/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h : 10| : ^In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx908. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx803. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 7 warnings generated when compiling for gfx940. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx942. 7 warnings generated when compiling for host. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 84%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx1102. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx1030. 7 warnings generated when compiling for gfx90a. 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx941. 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 85%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx940. 7 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. 7 warnings generated when compiling for gfx1101. 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx940. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/W7 warnings generated when compiling for gfx1101. ARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx900. 7 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoLL128, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:92:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoLL128>' requested here 92 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 1>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:14:1: note: in instantiation of member function 'RunWork, 1, 1>::run' requested here 14 | IMPL_COLL_FUNC(ReduceScatter, RING, LL128, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ 7 warnings generated when compiling for gfx908. 7 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 7 warnings generated when compiling for host. 7 warnings generated when compiling for gfx942. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx90a. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 72%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, arIn file included from g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpps:-1>: rIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hc:v10b: uIn file included from f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hf:,167 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :15: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hinitializer order does not match the declaration order [-Wreorder-ctor]: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d ) ,R unntWhorrekaEdlse(mnetnhtrx(.)x.)r,u ng(rwoeu)p;( g r| o ^u p), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp : 4| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 563 | 4s | tIeMpPSLi_zCeO(LnLc_cFlUSNhCm(eAml.lcRoemdmu.cbeu,f fCSOiLzLeNsE[TN_CDCILR_EPCRTO,T OS_ISMIPMLPEL,E ]P/rNeCMCuLl_SSuTmE,P Si/nsti8z_eto)f ( T| )^) { | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 391 :| 95 group(group: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), t().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562h:r15e:a dIdxwarning: .initializer order does not match the declaration order [-Wreorder-ctor]x ), group(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , nthr e563a | d s ( n tshtreepaSdisz)e,( ntcicdlISnhBmleomc.kc(otmhmr.ebaudfIfdSxi.zxe)s,[ NgCrCoLu_pP(RgOrToOu_pS)I,M P L| E ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~] / N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_ST E563P | S / s i zsetoefp(STi)z)e ({n c c| l ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S h m| e group(groupm .comm.buf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hf:S641i:z11e:s [note: Nin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC CL_PROTO _641S | I M P L E ] / N C C Lp_rSiTmEsP(St/isdi-zteiodfS(tTa)r)t R{e d u| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e , | n group(groupT hreadsRed/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:c626e:,9 :d inote: rin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree ct->d o626w | n , & d i r e cptr-i>mosu(tt,i da-rtgisd-S>tsaerntdSbcuaftft,e ra,r gnsT-h>rreeacdvsbSucfaft,t e r| , ^ NULL, dir/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:c202t:-53>:u pnote: ,in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here args -202> | s e n d b u f f ,R uanrWgosr-k>Erleecmvebnutf, 2, 2>::run' requested herer oto> (202) | . r u n ( w e ) ;R u n| W ^o rkElemen/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppt:<5F:n1,: Tnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here RedO p5, | IAMlPgLo_,C OPLrLo_tFoU>N(C)(.ArlulnR(ewdeu)c; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIREIn file included from CT, SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppM:P1L: EIn file included from ,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :P10r: eIn file included from M/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hu:l167S: u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:,562 :u15i:n twarning: 8initializer order does not match the declaration order [-Wreorder-ctor]_ t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: 562expanded from macro 'IMPL_COLL_FUNC' | tid (391t | i d )R,u nnWtohrrkeo,u pN(CgCrLo_uApL)G,O _ #| # ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a l g| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), NCCL _563P | R O T O _s#t#epprSoitzoe>((n)c.crluSnh(m&enmc.ccloSmhmm.ebmu.fwfoSrikz)e;s [\N C C| L ^_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I15M:P Lnote: Efield 'nthreads' will be initialized after field 'tidInBlock'] /NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads), tidInBlock(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a626d:I9d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , grou p626( | g r o u p ) , p| r ^~~~~~~~~~~~~~~~~i ms/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d60-:t inote: dfield 'group' will be initialized after field 'stepSize'S tartS c562a | t t e r ,t indT(htrieda)d,s Snctahtrteeard,s (NnUtLhLr,e addisr)e,c tt-i>duIpn,B laorcgks(-t>hsreenaddbIudfxf.,x )a,r ggsr-o>urpe(cgvrbouufpf),, | | ^ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCdCsLR_eAdLuGcOe_,# #naullglop,t rN,C C&Ld_iPrReOct->out,T Oa_r#g#sp-r>osteon>d(b)u.frfu,n (a&rngcsc-l>Srhemcevmb.uwfofr,k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:: 202note: :field 'nthreads' will be initialized after field 'tidInBlock'53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202t | i d ( t i d ) , RnutnhWroerakdEsl(enmtehnrte.(x)).,r ugnr(owuep)(;g r o| u ^p ), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h4::5621::60 :note: in instantiation of member function 'RunWork, 2, 2>::run' requested herenote: field 'group' will be initialized after field 'stepSize' 4 | 562I | M P L _ CtOiLdL(_tFiUdN)C,( AnltlhRreedaudcse(,n tChOrLeLaNdEsT)_,D ItRiEdCITn,B lSoIcMkP(LtEh,r ePardeIMduxl.Sxu)m,, girnotu8p_(tg)r o u| p^) , | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:O562T:O15_:S Iwarning: Minitializer order does not match the declaration order [-Wreorder-ctor]P LE]/NCC L562_ | S T E P St/isdi(zteiodf)(,T )n)t h{r e a| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s ( n| t group(grouph reads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d655I:n11B:l onote: cin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herek (threa d655I | d x . x ) , g r o uppr(igmrso(utpi)d,- t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S t a| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t Reduc e563, | n T h rsetaedpsSRiezdeu(cnec,c lnSuhlmlepmt.rc,o m&md.ibruefcftS-i>zoeust[,N CaCrLg_sP-R>OsTeOn_dSbIuMfPfL,E ]a/rNgCsC-L>_rSeTcEvPbSu/fsfi,z e o| f ^( T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : group(group53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626 :2029 | : note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here R u626n | W o r k E l e m epnrtih(r)e.ardusnS(cwaet)t;e r ,| ^N ULL, di/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppr:e5c:t1-:> unote: pin instantiation of member function 'RunWork, 2, 2>::run' requested here, args -5> | sIeMnPdLb_uCfOfL,L _aFrUgNsC-(>ArlelcRvebduufcfe,, C| O ^L LNET_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hD:I202R:E53C:T ,note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereS IMPL E202, | P r e M u l S uRmu,n WuoirnktE8l_etm)e n t| <^F n, T,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :R391e:d95O:p ,note: expanded from macro 'IMPL_COLL_FUNC'A lgo, Pr o391t | o > (R)u.nrWuonr(kw, 2, 2>::run' requested here Func #5# | dIeMvPrLe_dCoOpLC,( ANlClCRLe_dAuLcGeO,_ #C#OaLlLgNoE,T _NDCICRLE_CPTR,O TSOI_M#P#LpEr,o tPor>e(M)u.lrSuunm(,& nucicnltS8h_mte)m . w| o^r k); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h\: 391 :| 95 ^: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :391 | note: field 'nthreads' will be initialized after field 'tidInBlock' RunWor k562< | n c c l Ftuindc(#t#ifdu)n,c ,n tthyrpee, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:f562S:i15z:e swarning: [initializer order does not match the declaration order [-Wreorder-ctor]N CCL_PROT O562_ | S I M P LtEi]d/(NtCiCdL)_,S TnEtPhSr/esaidzse(onft(hTr)e)a d{s ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| I group(groupn Block(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a687d:I11d:x .note: xin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here) , gro u687p | ( g r o u p ) , | p ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r i m| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( tid-t i563d | S t a r tsBtceapsSti,z en(TnhcrcelaSdhsmBecma.scto,m m&.dbiurfefcSti-z>eosu[tN,C CnLu_lPlRpOtTrO,_ SaIrMgPsL-E>]s/eNnCdCbLu_fSfT,E PaSr/gssi-z>eroefc(vTb)u)f f{, | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :note: 677in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here202 | 677 | R u n W o r k E l epmreinmts<(Ftni,d -Tt,i dRSetdaOrpt,B cAalsgto,, nPTrhorteoa>d(s)B.crausnt(,w e&)d;i r e| c ^t ->out/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp,: 4d:i1r:e cnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested here- >dow n4, | args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hI:n562B:l15o:c kwarning: (initializer order does not match the declaration order [-Wreorder-ctor]t hreadId x562. | x ) , tigdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~( nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| :^562 :15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hwarning: :initializer order does not match the declaration order [-Wreorder-ctor]391 :95: note: expanded from macro 'IMPL_COLL_FUNC' 562 | 391 | t iRdu(ntWiodr)k,< nncthrecads(lnFtuhnrce#a#dfsu)n,c ,t itdyIpneB,l oFcukn(ct#h#rdeeavdrIeddxo.px<)t,y pger>o,u pN(CgCrLo_uApL)G,O _ #| # ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a l g| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), NCCL_PR O563T | O _ # # psrtoetpoS>i(z)e.(rnucnc(l&SnhcmcelmS.hcmoemmm..wbourfkf)S;i z\e s [| N ^C CL_PROTO_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E15]:/ Nnote: Cfield 'nthreads' will be initialized after field 'tidInBlock'C L_STEPS /562s | i z e o ft(iTd)()t i{d ) ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d655I:n11B:l onote: cin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herek (threa d655I | d x . x ) , g r o uppr(igmrso(utpi)d,- t i| d ^~~~~~~~~~~~~~~~~S tart/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:e562d:u60c:e ,note: field 'group' will be initialized after field 'stepSize'n Threa d562s | R e d u ctei,d (ntuildl)p,t rn,t h&rdeiardesc(tn-t>horueta,d sa)r,g st-i>dsIennBdlboucfkf(,t harregasd-I>drxe.cxv)b,u fgfr,o u p| ( ^g roup), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ irect->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h202::56253::15 :note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herewarning: initializer order does not match the declaration order [-Wreorder-ctor] 202 | 562 | RtuindW(otrikdE)l,e mnetnhtr((t)h.rreuand(Iwdex).;x ) ,| ^g roup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppo:u5p:)1,: note: | in instantiation of member function 'RunWork, 2, 2>::run' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 5 | IM P563L | _ C O L Ls_tFeUpNSCi(zAel(lnRcecdluSchem,e mC.OcLoLmNmE.Tb_uDfIfRSEiCzTe,s [SNICMCPLL_EP,R OPTrOe_MSuIlMSPuLmE,] /uNiCnCtL8__StT)E P S| /^s izeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:T391):)95 :{ note: expanded from macro 'IMPL_COLL_FUNC'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group391 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:<687n:c11c:l Fnote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren c##fu n687c | , t y p e , F u npcr#i#mdse(vtriedd-otpit,B cNaCsCtL,_ AnLTGhOr_e#a#daslBgcoa,s tN,C C&Ld_iPrReOcTtO-_>#o#uptr,o tnou>l(l)p.trru,n (a&rngcsc-l>Sshemnedmb.uwfofr,k )a;r g\s - >| r ^e cvbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 : 15| : ^ note: field 'nthreads' will be initialized after field 'tidInBlock' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562: | 202 : 53 : tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered (tid )202, | n t h r e a d sR(unntWhorrekaEdlse)m,e nttiu(p)(.grruonu(pw)e,) ; | ^~~~~~~~~~~~~~~~~| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:: 5note: :field 'group' will be initialized after field 'stepSize'1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 562 | 5 | ItMiPdL(_tCiOdL)L,_ FnUtNhCr(eAaldlsR(endtuhcree,a dCsO)L,L NtEiTd_IDnIBRlEoCcTk,( tShIrMePaLdEI,d xP.rxe)M,u lgSruomu,p (ugirnotu8p_)t,) | | ^~~~~~~~~~~^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ OLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]field 'nthreads' will be initialized after field 'tidInBlock' 562 | 562 | t itdi(dt(itdi)d,) ,n tnhtrheraedasd(sn(tnhtrheraedasd)s,) ,t itdiIdnIBnlBolcokc(kt(htrheraedaIddIxd.xx.)x,) ,g rgoruopu(pg(rgoruopu)p,) , | ^~~~~~~~~~~~~~~~~| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 :60: note: field 'group' will be initialized after field 'stepSize'563 | s562t | e p S i ztei(dn(ctcildS)h,m enmt.hcroemamd.sb(unftfhSriezaedss[)N,C CtLi_dPIRnOBTlOo_cSkI(MtPhLrEe]a/dNICdCxL._xS)T,E PgSr/osuipz(egorfo(uTp))), { | ^~~~~~~~~~~| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ C(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hds), ti:d562I:n15B:l owarning: cinitializer order does not match the declaration order [-Wreorder-ctor]k (threadIdx.x), gr o562u | p ( g r otuipd)(,t i d| ) ^~~~~~~~~~~~~~~~~, nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d60s:) ,note: field 'group' will be initialized after field 'stepSize't idInBloc k562( | t h r e atdiIdd(xt.ixd)),, gnrtohurpe(agdrso(unpt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t idIn B563l | o c k ( tshtreepaSdiIzdex(.nxc)c,l Sghrmoeump.(cgormomu.pb)u,f f S| i ^~~~~~~~~~~z es[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp: 5562: | 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested heret id(tid )5, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pP(rgerMouulpS)u,m , | u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i n t| 8 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ t) | ^563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:e391p:S95i:z enote: (expanded from macro 'IMPL_COLL_FUNC'n cclShme m391. | c o mRmu.nbWuofrfkST,) )N C{C L _| A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L G O| _ group(group# #algo, NCCL_P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hR:O677T:O11_:# #note: pin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oto>() .677r | u n ( & n c c l S h mpermi.mwso(rtki)d;- t\i d S| t ^a rtBcast/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562n:T15h:r enote: afield 'nthreads' will be initialized after field 'tidInBlock'd sBcas t562, | & d i rteicdt(-t>iodu)t,, ndtihrreecatd-s>(dnotwhnr,e aadrsg)s,- >tsiednIdnbBulfofc,k (atrhgrse-a>drIedcxv.bxu)f,f ,g r o| u ^p (group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):,202 : 53| : ^~~~~~~~~~~~~~~~~ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:60 :202 | note: field 'group' will be initialized after field 'stepSize' 562R | u n W o rtkiEdl(etmiedn)t,< Fnnt,h rTe,a dRse(dnOtph,r eAaldgso),, PtriodtIon>B(l)o.crku(nt(hwree)a;d I d| x ^. x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppo:u5p:(1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep ), | ^~~~~~~~~~~5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~o up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:T562_:D15I:R Ewarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]T , SIMPLE, P562r | e M u l Stuimd,( tuiidn)t,8 _ntt)h r e| a^d s(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r391e:a95d:s )note: ,expanded from macro 'IMPL_COLL_FUNC' tidInB l391o | c k (RtuhnrWeoardkIe,p SNiCzCeL(_nAcLcGlOS_h#m#eaml.gcoo,m mN.CbCuLf_fPSRiOzTeOs_[#N#CpCrLo_tPoR>O(T)O._rSuInM(P&LnEc]c/lNSChCmLe_mS.TwEoPrSk/)s;i z\e o f| ( ^T )) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :| 562 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~: 15 :| group(groupnote: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkd,) ,N CnCtLh_rAeLaGdOs_(#n#tahlrgeoa,d sN)C,C Lt_iPdRIOnTBOl_o#c#kp(rtohtroe>a(d)I.drxu.nx()&,n cgcrloSuhpm(egmr.owuopr)k,) ; | \ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | | ^ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 562563: | 15 : note: field 'nthreads' will be initialized after field 'tidInBlock's tepS i562z | e ( n c ctliSdh(mteimd.)c,o mnmt.hbruefafdSsi(znetsh[rNeCaCdLs_)P,R OtTiOd_ISnIBMlPoLcEk](/tNhCrCeLa_dSITdExP.Sx/)s,i zgeroofu(pT()g)r o{u p )| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | | ^~~~~~~~~~~~~~~~~ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687: 11562: | note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tid(ti d687) | , n t h r e a d s (pnrtihmrse(atdisd)-,t itdiSdtIanrBtlBoccaks(tt,h rneTahdrIedaxd.sxB)c,a sgtr,o u&pd(igrreocutp-)>,o u t| , ^~~~~~~~~~~ nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:a562t:t15e:r ,warning: initializer order does not match the declaration order [-Wreorder-ctor]N ULL, direct -562> | u p , atrigds(-t>isde)n,d bnutfhfr,e aadrsg(sn-t>hrreecavdbsu)f,f ,t i d| I ^n Block(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:eadIdx202.:x53):, note: gin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer oup(g r202o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ R u| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)W orkEl e563m | e n t < Fsnt,e pTS,i zRee(dnOcpc,l SAhlmgeom,. cPormomt.ob>u(f)f.Sriuzne(sw[eN)C;C L _| P ^R OTO_SIM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppP:L7E:]1/:N Cnote: Cin instantiation of member function 'RunWork, 2, 2>::run' requested hereL _STE P7S | /IsMiPzLe_oCfO(LTL)_)F U{N C (| A ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l l R| e group(groupd uce, COLLNET/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h_:D626I:R9E:C Tnote: ,in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here SIMPL E626, | P r e M u l S upmr,i musi(ntti3d2-_tti)d S t| a^r tSca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:t391e:r95,: nnote: Texpanded from macro 'IMPL_COLL_FUNC'h readsS c391a | t t eRru,n WNoUrLkL<,n cdcilrFeucntc-#>#ufpu,n ca,r gtsy-p>es,e nFdubnucf#f#,d eavrrgesd-o>prf,f ,N C C| L ^_ ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:a202l:g53o:, note: Nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC CL_P R202O | T O _ # # p r o tRou>n(W)o.rrkuEnl(e&mnecnctl562(:)15.:r unote: nfield 'nthreads' will be initialized after field 'tidInBlock'( we); | ^562 | ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:(6t:i1d:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested heren thre a6d | sI(MnPtLh_rCeOaLdLs_)F,U NtCi(dAIlnlBRleodcukc(et,h rCeOaLdLINdExT._xD)I,R EgCrTo,u pS(IgMrPoLuEp,) ,P r e| M ^~~~~~~~~~~~~~~~~u lSu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hm:,562 :i60n:t 3note: 2field 'group' will be initialized after field 'stepSize'_ t) | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:(95t:i dnote: )expanded from macro 'IMPL_COLL_FUNC', nthrea d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~ NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ WorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp::5627::151:: warning: note: initializer order does not match the declaration order [-Wreorder-ctor]in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | I M562P | L _ C O LtLi_dF(UtNiCd()A,l lnRtehdruecaed,s (CnOtLhLrNeEaTd_sD)I,R EtCiTd,I nSBIlMoPcLkE(,t hPrreeaMduIldSxu.mx,) ,u ignrto3u2p_(tg)r o u| p^) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)95 : note: expanded from macro 'IMPL_COLL_FUNC' 563 | 391 | s t eRpuSniWzoer(kn/,N CNCCLC_LS_TAELPGSO/_s#i#zaelogfo(,T )N)C C{L _ P| R ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~O T O| _ group(group# #proto>().run(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h&:n655c:c11l:S hnote: min instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree m.work )655; | \ | ^ p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:i562m:s15(:t inote: dfield 'nthreads' will be initialized after field 'tidInBlock'- tidSt a562r | t R e d utcied,( tniTdh)r,e andtshRreedaudcse(,n tnhurlelapdtsr),, &tdiidrIencBtl-o>coku(tt,h raeragdsI-d>xs.exn)d,b ugfrfo,u pa(rggrso-u>pr)e,c v b| u ^~~~~~~~~~~~~~~~~f f, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :60: note: field 'group' will be initialized after field 'stepSize' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h | : 202 : 53 :t inote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested here( tid), 202n | t h r e a d s ( nRtuhnrWeoardksE)l,e mteindtI((g)r.oruupn)(,w e )| ; ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h grou:p562(:g15r:o uwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]) , | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562562: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] tid(tid) ,562 | n t h r etaidds((tnitdh)r,e andtsh)r,e atdisd(InntBhlroecakd(st)h,r etaiddIIdnxB.lxo)c,k (gtrhoruepa(dgIrdoxu.px)),, g| r ^~~~~~~~~~~~~~~~~o up(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u60p:) ,note: field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | 562 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | t i563d | ( t i d )s,t enptShirzeea(dnsc(cnltShhrmeeamd.sc)o,m mt.ibduIfnfBSliozceks([tNhCrCeLa_dPIRdOxT.Ox_)S,I MgPrLoEu]p/(NgCrCoLu_pS)T,E P S| / ^~~~~~~~~~~s izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C15C:L _warning: Pinitializer order does not match the declaration order [-Wreorder-ctor]R OTO_##proto >562( | ) . r u nt(i&dn(ctcildS)h,m enmt.hwroerakd)s;( n\t h r| e ^a ds), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:I562n:B15l:o cnote: kfield 'nthreads' will be initialized after field 'tidInBlock'( thread I562d | x . x ) ,t igdr(otuipd()g,r onutph)r,e a d| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads )563, | t i d IsntBelpoScikz(et(hnrcecaldSIhdmxe.mx.)c,o mgmr.obuupf(fgSriozueps)[,N C C| L ^~~~~~~~~~~~~~~~~_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562S:I60M:P Lnote: Efield 'group' will be initialized after field 'stepSize'] /NCCL _562S | T E P S /tsiidz(etoifd()T,) )n t{h r e| a ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d s (| n group(groupt hreads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I687n:B11l:o cnote: kin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( threa d687I | d x . x ) , g r o uppr(igmrso(utpi)d,- t i| d ^~~~~~~~~~~S tartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | .x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ adIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ InBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dId/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hx:.562x:)60,: gnote: rfield 'group' will be initialized after field 'stepSize'o up(g r562o | u p ) , t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( t i| d tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , nt h563r | e a d s (snttehprSeiazdes()n,c ctliSdhImneBml.occokm(mt.hbruefafdSIidzxe.sx[)N,C CgLr_oPuRpO(TgOr_oSuIpM)P,L E ]| / ^~~~~~~~~~~N CCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStamrtRedeumc.ec,o mnmT.hbruefafdSsiRzeedsu[cNeC,C Ld_iPrReOcTtO-_>SdIoMwPnL,E ]&/dNiCrCeLc_tS-T>EoPuSt/,s iazregosf-(>Ts)e)n d{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, a| r group(groupg s->recvbuff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :687:11: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: 202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here687 | 202 | p r i mRsu(ntWiodr-ktEildeSmteanrtte(c)t.-r>uonu(tw,e )n;u l l| p ^t r, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpps:-7>:s1e:n dnote: bin instantiation of member function 'RunWork, 2, 2>::run' requested hereu ff, 7a | rIgMsP-L>_rCeOcLvLb_uFfUfN,C ( A| l ^l Reduce, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hO:L202L:N53E:T _note: Din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereI RECT ,202 | S I M P L E , PRruenMWuolrSkuEml,e mueinntt<3F2n_,t )T , | R^e dOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :A391l:g95o:, note: Pexpanded from macro 'IMPL_COLL_FUNC'r oto>() .391r | u n (Rwuen)W;o r k| < ^n cclFu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppn:c7#:#1f:u nnote: cin instantiation of member function 'RunWork, 2, 2>::run' requested here, ty p7e | ,I MFPuLn_cC#O#LdLe_vFrUeNdCo(pAu,c eN,C CCLO_LALLNGEOT__#D#IaRlEgCoT,, NSCICMLP_LPER,O TPOr_e#M#uplrSoutmo,> (u)i.nrtu3n2(_&tn)c c l| S^h mem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hw:o391r:k95):; note: \expanded from macro 'IMPL_COLL_FUNC' | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hR:u562n:W15o:r knote: ,, tNiCdCILn_BAlLoGcOk_(#t#harlegaod,I dNxC.CxL)_,P RgOrToOu_p#(#gprrooutpo)>,( ) .| r ^~~~~~~~~~~~~~~~~u n(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h&:n562c:c60l:S hnote: mfield 'group' will be initialized after field 'stepSize'e m.wo r562k | ) ; \ t i| d ^( tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock'( nthre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~t hreadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement (warning: )initializer order does not match the declaration order [-Wreorder-ctor]. run(we); | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :t7i:d1(:t inote: din instantiation of member function 'RunWork, 2, 2>::run' requested here) , nt h7r | eIaMdPsL(_nCtOhLrLe_aFdUsN)C,( AtlildRIendBulcoec,k (CtOhLrLeNaEdTI_dDxI.RxE)C,T ,g rSoIuMpP(LgEr,o uPpr)e,M u l| S ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~u m ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u int32 _563t | ) | ^s tepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hz:e391(:n95c:c lnote: Sexpanded from macro 'IMPL_COLL_FUNC'h mem.com m391. | b u fRfuSniWzoersk[ , | N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C C L| _ group(groupA LGO_##algo, N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:C687L:_11P:R Onote: Tin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereO _##pr o687t | o > ( ) . r u n ( & npcrcilmSsh(mteimd.-wtoirdkS)t;a r\t B c| a ^s t, nT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:s Bnote: cfield 'nthreads' will be initialized after field 'tidInBlock'a st, &d i562r | e c t - >toiudt(,t indu)l,l pnttrh,r eaardgss(-n>tshernedabdusf)f,, tairdgIsn-B>lroecckv(btuhfrfe,a d I| d ^x .x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:(53g:r onote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herep ), | 202 ^~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60R:u nnote: Wfield 'group' will be initialized after field 'stepSize'o rkElem e562n | t < F n ,t iTd,( tRiedd)O,p ,n tAhlrgeoa,d sP(rnotthor>e(a)d.sr)u,n (twied)I;n B l| o ^c k(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppe:a5d:I1d:x .note: xin instantiation of member function 'RunWork, 2, 2>::run' requested here) , gr o5u | pI(MgPrLo_uCpO)L,L _ F| U ^~~~~~~~~~~N C(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, N:CC562L:_15A:L Gwarning: Oinitializer order does not match the declaration order [-Wreorder-ctor]_ ##algo, NCCL_ P562R | O T O _ #t#ipdr(ottiod>)(,) .nrtuhnr(e&andcsc(lnSthhmreema.dwso)r,k )t;i d\I n B| l ^o ck(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562a:d15I:d xnote: .field 'nthreads' will be initialized after field 'tidInBlock'x ), g r562o | u p ( g rtoiudp()t,i d )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ n t| h tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r eads( n563t | h r e a dsst)e,p StiizdeI(nnBclcolcSkh(mtehmr.ecaodmImd.xb.uxf)f,S igzreosu[pN(CgCrLo_uPpR)O,T O _| S ^~~~~~~~~~~~~~~~~I MPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h]:/562N:C60C:L _note: Sfield 'group' will be initialized after field 'stepSize'T EPS/s i562z | e o f ( Tt)i)d ({t i d| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| t group(grouph reads(nthreads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 655t:i11d:I nnote: Bin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herel ock(t h655r | e a d I d x . x ) , pgrriomusp((tgirdo-utpi)d,S t a| r ^~~~~~~~~~~t Reduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args-B>sendcbausftf,, &adrigrse-c>tr-e>covubtu,f fn,u l l| ptr, a ^r gs->send/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hb:u202f:f53,: anote: rin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereg s->r e202c | v b u f f , | R ^u nWorkElement, 2, 2>::run' requested herep , Alg o202, | P r o t o > ( )R.urnuWno(rwkeE)l;e m e| n ^t , 2, 2>::run' requested here Algo ,9 | PIrMoPtLo_>C(O)L.Lr_uFnU(NwCe()A;l l R| e ^d uce, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppL:L7N:E1T:_ Dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested hereR ECT, 7S | IIMMPPLLE_,C OPLrLe_MFuUlNSCu(mA,l luRiendtu6c4e_,t )C O L| L^N ET_D/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:R391E:C95T:, note: Sexpanded from macro 'IMPL_COLL_FUNC'I MPLE, 391P | r e MRuulnSWuomr,k r,k <(t)y.preu>n,( &NnCcCcLl_SAhLmGeOm_.#w#oarlkg)o;, \N C C| L ^_ PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hO:_562#:#15p:r onote: tfield 'nthreads' will be initialized after field 'tidInBlock'o >().r u562n | ( & n c ctliSdh(mteimd.)w,o rnkt)h;r e\a d s| ( ^n threads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):,562 :t15i:d Inote: nfield 'nthreads' will be initialized after field 'tidInBlock'B lock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~e ads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i60d:I nnote: Bfield 'group' will be initialized after field 'stepSize'l ock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~e ads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i60d:I nnote: Bfield 'group' will be initialized after field 'stepSize'l ock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~e ads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:(562g:r15o:u pwarning: )initializer order does not match the declaration order [-Wreorder-ctor], | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h562::20215:: 53warning: :initializer order does not match the declaration order [-Wreorder-ctor] note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202562 | | t i d (RtuindW)o,r knEtlhermeeandts<(Fnnt,h rTe,a dRse)d,O pt,i dAIlngBol,o cPkr(otthor>e(a)d.Irduxn.(xw)e,) ;g r o| u ^p (group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp):,8 : 1| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 8 | I563M | P L _ C OsLtLe_pFSUiNzCe((AnlclcRleSdhumceem,. cCoOmLmL.NbEuTf_fDSIiRzEeCsT[,N CSCILM_PPLREO,T OP_rSeIMMuPlLSEu]m/,N CiCnLt_6S4T_EtP)S / s| i^z eof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:T391):)95 :{ note: expanded from macro 'IMPL_COLL_FUNC'| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 391 | RunW/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:r626k:<9n:c cnote: lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hads(n:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]) , tidInBlock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~e ads)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562t:i60d:I nnote: Bfield 'group' will be initialized after field 'stepSize'l ock(t h562r | e a d I dtxi.dx()t,i dg)r,o unpt(hgrreoaudps)(,n t h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , ti d563I | n B l o cskt(etphSriezaed(Indcxc.lxS)h,m egmr.ocuopm(mg.rbouufpf)S,i z e| s ^~~~~~~~~~~[ NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ _t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :202562 | : 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] RunWo r562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | kEle me n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppu:p8(:g1r:o unote: pin instantiation of member function 'RunWork, 2, 2>::run' requested here) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 8 | | I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PL_COLL_F U563N | C ( A l lsRteedpuSciez,e (CnOcLcLlNSEhTm_eDmI.REcCoTm,m .SbIuMfPfLSEi,z ePsr[eNMCuClLS_uPmR,O TiOn_tS6I4M_PtL)E ] /| N^C CL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hP:S391/:s95i:z enote: oexpanded from macro 'IMPL_COLL_FUNC'f (T)) { 391| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R| u group(groupn Work, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, Func# #687d | e v r e d o p < t y pper>i,m sN(CtCiLd_-AtLiGdOS_t#a#ratlBgcoa,s tN,C CnLT_hPrReOaTdOs_B#c#apsrto,t o&>d(i)r.ercutn-(>&onuctc,l Snhumlelmp.twro,r ka)r;g s\- > s| e ^n dbuff, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-15>:r enote: cfield 'nthreads' will be initialized after field 'tidInBlock'v buff, | ^562 | tid(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d202):,53 :n tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer eads (202n | t h r e a d s ) ,R utniWdoIrnkBElloecmke(ntth ( )| . ^~~~~~~~~~~~~~~~~r un(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:e562):;60 : | note: ^field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp562: | 8 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid )8, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pP(rgerMouulpS)u,m , | i ^~~~~~~~~~~n t64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hp:)562,: 15 :| ^~~~~~~~~~~~~~~~~warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60 :562 | note: field 'group' will be initialized after field 'stepSize' tid (562t | i d ) , tnitdh(rteiadd)s,( nntthhrreeaaddss)(,n tthirdeIandBsl)o,c kt(itdhIrneBaldoIcdkx(.txh)r,e agdrIoduxp.(xg)r,o ugpr)o,u p (| g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) , | ^~~~~~~~~~~563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15:: 562warning: :initializer order does not match the declaration order [-Wreorder-ctor]15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid (562t | i d ) , tnitdh(rteiadd)s, (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~o u p| ) tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ 563| | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) s t563e | p S i z es(tnecpcSliSzhem(enmc.ccloSmhmm.ebmu.fcfoSmimz.ebsu[fNfCSCiLz_ePsR[ONTCOC_LS_IPMRPOLTEO]_/SNICMCPLL_ES]T/ENPCSC/Ls_iSzTeEoPfS(/Ts)i)z e{o f (| T ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) ) | { group(group | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :655677 | : 11 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here p r677i | m s ( t i d - t i d SptrairmtsR(etdiudc-et,i dnSTtharretaBdcsaRsetd,u cneT,h rneualdlspBtcra,s t&,d i&rdeicrte-c>to-u>to,u ta,r gdsi-r>escetn-d>bduofwfn,, aarrggss-->>rseecnvdbbuuffff,, a| r ^g s->recvb/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:f202f:,53 : | note: ^in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:5315:: note: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested herefield 'nthreads' will be initialized after field 'tidInBlock' 562 | 202 | t i d ( t i dR)u,n WnotrhkrEelaedmse(nnttI(d)x..rxu)n,( wger)o;u p (| g ^r oup), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::9562::160:: note: note: in instantiation of member function 'RunWork, 2, 2>::run' requested herefield 'group' will be initialized after field 'stepSize' 9 | 562I | M P L _ CtOiLdL(_tFiUdN)C,( AnltlhRreedaudcse(,n tChOrLeLaNdEsT)_,D ItRiEdCITn,B lSoIcMkP(LtEh,r ePardeIMduxl.Sxu)m,, gurionutp6(4g_rto)u p )| ,^ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 : note: field 'nthreads' will be initialized after field 'tidInBlock'R unWorkE l562e | m e n t s()),. rtuind(IwneB)l;o c k| ( ^t hreadIdx.x), g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppr:o8u:p1(:g rnote: oin instantiation of member function 'RunWork, 2, 2>::run' requested hereu p), 8| | ^~~~~~~~~~~~~~~~~I MPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:O562L:L60_:F Unote: Nfield 'group' will be initialized after field 'stepSize'C (AllRed u562c | e , C OtLiLdN(EtTi_dD)I,R EnCtTh,r eSaIdMsP(LnEt,h rPeraedMsu)l,S utmi,d IinnBtl6o4c_kt()t h r| e^a dIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:)391,: 95g:r onote: uexpanded from macro 'IMPL_COLL_FUNC'p (grou p391) | , R| u ^~~~~~~~~~~n Work, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &,direc tn-t>horueta,d sd(inrtehcrte-a>ddso)w,n ,t iadrIgnsB-l>oscekn(dtbhurfefa,d Iadrxg.sx-)>,r egcrvobuupf(fg,r o u| p ^) , | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 562:60: note: 202field 'group' will be initialized after field 'stepSize' | 562 | R u n W otrikdE(lteimde)n,t l(o)c.kr(utnh(rweea)d;I d x| . ^x ), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp(:g8r:o1u:p )note: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^~~~~~~~~~~ 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-:t95i:d Snote: texpanded from macro 'IMPL_COLL_FUNC'a rtScatter, 391n | T h rReuandWsoSrckaeu,p ,F uanrcg#s#-d>esverneddboupfg,s -N>CrCeLc_vAbLuGfOf_,# # a| l ^g o, NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:_202P:R53O:T Onote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here# #pro t202o | > ( ) . r u n ( &RnucncWloSrhkmEelme.mweonrtk<)F;n ,\ T ,| ^R edOp, Algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :P562r:o15t:o >note: (field 'nthreads' will be initialized after field 'tidInBlock') .run(we )562; | | ^ tid(ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:)9,: 1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads( n9t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ICdOxL.LxN)E,T _gDrIoRuEpC(Tg,r oSuIpM)P,L E ,| ^~~~~~~~~~~~~~~~~P reMu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:S562u:m60,: unote: ifield 'group' will be initialized after field 'stepSize'n t64_t )562 | | ^ tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:t391i:d95):, note: nexpanded from macro 'IMPL_COLL_FUNC't hreads (391n | t h rReuandWso)r,k , NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##al/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:g562o:,15 :N Cwarning: Cinitializer order does not match the declaration order [-Wreorder-ctor]L _PROTO_## p562r | o t o > (t)i.dr(utni(d&)n,c cnltShhrmeeamd.sw(onrtkh)r;e a\d s )| , ^ tidInBloc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hk:(562t:h15r:e anote: dfield 'nthreads' will be initialized after field 'tidInBlock'I dx.x), 562g | r o u p (tgirdo(utpi)d,) , | n ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~t h r| e tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)a ds(nt h563r | e a d s )s,t etpiSdiIzneB(lnoccckl(Sthhmreema.dcIodmxm..xb)u,f fgSriozueps([gNrCoCuLp_)P,R O T| O ^~~~~~~~~~~~~~~~~_ SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:P562L:E60]:/ Nnote: Cfield 'group' will be initialized after field 'stepSize'C L_STE P562S | / s i z etoifd((Tt)i)d ){, n| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h r e| a group(groupd s(nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:)655,: 11t:i dnote: Iin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heren Block( t655h | r e a d I d x . x ) ,p rgirmosu(pt(igdr-otuipd)S,t a r| t ^~~~~~~~~~~R educe, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lFunc##func, type, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ t->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::95562:: 15note: :expanded from macro 'IMPL_COLL_FUNC' warning: initializer order does not match the declaration order [-Wreorder-ctor] 391 | R562u | n W o r ktr,e aNdCICdLx_.AxL)G,O _g#r#oaulpg(og,r oNuCpC)L,_ P R| O ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~T O _| # tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)# proto >563( | ) . r u ns(t&enpcScilzSeh(mnecclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.work); \ | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :562 | note: field 'nthreads' will be initialized after field 'tidInBlock' tid(ti d562) | , n t htrieda(dtsi(dn)t,h rnetahdrse)a,d st(indtIhnrBelaodcsk)(,t htriedaIdnIBdlxo.cxk)(,t hgrreoaudpI(dgxr.oxu)p,) ,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ( g| r tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)o up), 563 | | ^~~~~~~~~~~~~~~~~ s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:e562p:S60i:z enote: (field 'group' will be initialized after field 'stepSize'n cclS h562m | e m . c otmimd.(btuifdf)S,i znetsh[rNeCaCdLs_(PnRtOhTrOe_aSdIsM)P,L Et]i/dNICnCBLl_oScTkE(PtSh/rseiazdeIodfx(.Tx))), {g r o| u ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p ( g| r group(groupo up), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:o562f:(15T:) )warning: initializer order does not match the declaration order [-Wreorder-ctor]{ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 562 group(group | tid(tid), nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r666e:a9d:s )note: ,in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here tidInB l666o | c k ( t h r e a dpIrdixm.sx()t,i dg,r onuTph(rgeraoduspG)a,t h e| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, d| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r ect-> u563p | , N U LsLt,e paSrigzse-(>nscecnldSbhumfefm,. caormgms.-b>urfefcSvibzuefsf[,N C C| L ^_ PROTO_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hI:M202P:L53E:] /note: Nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC CL_S T202E | P S / s i z e o fR(uTn)W)o r{k E l| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e n| t group(group< Fn, T, RedOp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h, :A666l:g9o:, note: Pin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herer oto>() .666r | u n( we ) ; | ^p rims(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppt:i10d:,1 :n Tnote: hin instantiation of member function 'RunWork, 2, 2>::run' requested herer ead s10G | aItMhPeLr_,C OdLiLr_eFcUtN-C>(uApl,l RNeUdLuLc,e ,a rCgOsL-L>NsEeTn_dDbIuRfEfC,T ,a rSgIsM-P>LrEe,c vPbruefMfu,l S u| m ^, half) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :| 202^: 53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 391:95: 202note: | expanded from macro 'IMPL_COLL_FUNC' 391 | R uRnuWnoWrokrEklemed(e)v.rreudno(pw| , ^ NCCL_A/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppL:G10O:_1#:# anote: lin instantiation of member function 'RunWork, 2, 2>::run' requested hereg o, N C10C | LI_MPPRLO_TCOO_L#L#_pFrUoNtCo(>A(l)l.Rreudnu(c&en,c cClOSLhLmNeEmT._wDoIrRkE)C;T ,\ S I| M ^P LE, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:e562M:u15l:S unote: mfield 'nthreads' will be initialized after field 'tidInBlock', half) 562 | | ^ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:(391t:i95d:) ,note: expanded from macro 'IMPL_COLL_FUNC'n threa d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~~~~~~~ NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562A:L60G:O _note: #field 'group' will be initialized after field 'stepSize'# algo ,562 | N C C L _tPiRdO(TtOi_d#)#,p rnotthor>e(a)d.sr(unnt(h&rnecacdlsS)h,m etmi.dwIonrBkl)o;c k\( t h| r ^e adIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:x562):,15 :g rnote: ofield 'nthreads' will be initialized after field 'tidInBlock'u p(gro u562p | ) , | t ^~~~~~~~~~~i d(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nwarning: tinitializer order does not match the declaration order [-Wreorder-ctor]h reads(nth r562e | a d s ) ,t itdi(dtIindB)l,o cnkt(htrheraedasd(Indtxh.rxe)a,d sg)r,o utpi(dgIrnoBulpo)c,k ( t| h ^~~~~~~~~~~r eadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElemeantd(x)..xr)u,n (gwreo)u;p ( g| r ^o up), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :| 9 ^~~~~~~~~~~: 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcas/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:,562 :n15T:h rwarning: einitializer order does not match the declaration order [-Wreorder-ctor]a dsBcast ,562 | & d i r etcitd-(>toiudt),, dnitrherceta-d>sd(onwtnh,r eaardgss)-,> steinddIbnuBflfo,c ka(rtghsr-e>ardeIcdvxb.uxf)f,, g r| o ^u p(group),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~53 : | note: tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 563 | 202 | s t e p S iRzuen(WnocrcklESlhemmeemn.tcI(M)P.LrEu]n/(NwCeC)L;_ S T| E ^P S/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp(:T10):)1 :{ note: in instantiation of member function 'RunWork, 2, 2>::run' requested here| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 10 | IMPL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hC:O626L:L9_:F Unote: Nin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereC (AllR e626d | u c e , C O L LpNrEiTm_sD(ItRiEdC-Tt,i dSSItMaPrLtES,c aPtrteeMr,u lnSTuhmr,e ahdaslSfc)a t t| e^r , NULL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391d:i95r:e cnote: texpanded from macro 'IMPL_COLL_FUNC'- >up, ar g391s | - > sReunndWbourfkf<,n cacrlgFsu-n>cr#e#cfvubnucf,f ,t y p| e ^, Fun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:#202#:d53e:v rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered op< t202y | p e > , N C C LR_uAnLWGoOr_k#E#laelmgeon,t ().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ n, T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h562::56215::15 :warning: initializer order does not match the declaration order [-Wreorder-ctor]warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | 562 | t i dt(itdi(dt)i,d )n,t hnrtehardesa(dnst(hnrtehardesa)d,s )t,i dtIindBIlnoBclko(ctkh(rtehardeIaddxI.dxx).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | 563 | s t esptSeipzSei(znec(cnlcSchlmSehmm.ecmo.mcmo.mbmu.fbfuSfifzSeisz[eNsC[CNLC_CPLR_OPTROO_TSOI_MSPILMEP]L/EN]C/CNLC_CSLT_ESPTSE/PsSi/zseiozfe(oTf)()T ){) {| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~| group(group | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h9::677 :note: 11in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | 677 | p r i m sp(rtiimds-(ttiiddS-ttairdtSStcaartttBecra,s tn,T hnrTehardesaSdcsaBtctaesrt,, N&UdLiLr,e cdti-r>eocutt-,> udpi,r eacrtg-s>-d>oswenn,d baurfgfs,- >asregnsd-b>urfefc,v baurfgfs,- > r| e ^c vbuff,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ^53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202202 | : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here Ru n202W | o r k E l e m e nRtu (A)l.grou,n (Pwreo)t;o > (| ) ^. run(we)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp;: 11 :| 1 ^: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp: 1011: | 1I:M Pnote: Lin instantiation of member function 'RunWork, 2, 2>::run' requested here_ COLL _10F | UINMCP(LA_lClORLeLd_uFcUeN,C (CAOlLlLRNeEdTu_cDeI,R ECCOTL,L NSEITM_PDLIER,E CPTr,e MSuIlMSPuLmE,, fPlroeaMtu)l S u| m^, hal/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:)391 : 95| :^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391: 95391: | note: expanded from macro 'IMPL_COLL_FUNC'R unWork <391n | c c lRFuunnWco#r#kf, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ e, Func##devredop, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562562 | : 15 : warning: tinitializer order does not match the declaration order [-Wreorder-ctor]i d(tid), 562n | t h r e atdisd((nttihdr)e,a dnst)h,r etaiddsI(nnBtlhorceka(dtsh)r,e atdiIddIxn.Bxl)o,c kg(rtohurpe(agdrIoduxp.)x,) , | g ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o u| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)( grou p563) | , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s t e| p tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S ize( n563c | c l S h msetme.pcSoimzme.(bnucfcflSSihzmeesm[.NcCoCmLm_.PbRuOfTfOS_iSzIeMsP[LNEC]C/LN_CPCRLO_TSOT_ESPISM/PsLiEz]e/oNfC(CTL)_)S T{E P S| / ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s i z| e group(groupo f(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here677 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | 677 | p r i m s (ptriidm-st(itdiSdt-atritdRSetdaurcteB,c ansTth,r enaTdhsrReeadduscBec,a sntu,l l&pdtirr,e c&td-i>roeuctt,- >doiurte,c ta-r>gdso-w>ns,e nadrbgusf-f>,s eanrdgbsu-f>fr,e cavrbgusf-f>,r e c| v ^b uff, | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 202:53: 202note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R u n W o r k ERluenmWeonrtk,( )P.rroutno(>w(e)).;r u n| ( ^w e); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :11:1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:: 11note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | I M11P | LI_MCPOLL_LC_OFLULN_CF(UANlCl(RAeldluRceed,u cCeO,L LCNOELTL_NDEITR_EDCITR,E CSTI,M PSLIEM,P LPEr,e MPurleSMuuml,S ufml,o aftl)o a t| )^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::95391:: 95note: :expanded from macro 'IMPL_COLL_FUNC' note: expanded from macro 'IMPL_COLL_FUNC' 391 | 391 | R uRnuWnoWrokre,> ,N CNCCLC_LA_LAGLOG_O#_##a#laglog,o ,N CNCCLC_LP_RPORTOOT_O#_##p#rportoot>o(>)(.)r.urnu(n&(n&cncclcSlhSmhemme.mw.owrokr)k;) ;\ \ | ^| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::1515:: note: note: field 'nthreads' will be initialized after field 'tidInBlock'field 'nthreads' will be initialized after field 'tidInBlock' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::562562::6060:: note: note: field 'group' will be initialized after field 'stepSize'field 'group' will be initialized after field 'stepSize' 562562 | | ttiidd((ttiidd)),, nntthhrreeaaddss((nntthhrreeaaddss)),, ttiiddIInnBBlloocckk((tthhrreeaaddIIddxx..xx)),, ggrroouupp((ggrroouupp)),, | | ^~~~~~~~~~~ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h15: warning: :initializer order does not match the declaration order [-Wreorder-ctor]562 :15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t562i | d ( t i dt)i,d (nttihdr)e,a dnst(hnrtehardesa(dnst)h,r etaiddsI)n,B ltoicdkI(ntBhlroecakd(Itdhxr.exa)d,I dgxr.oxu)p,( ggrroouupp)(,g r o| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~p ) ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)563 | 563s | t e p S isztee(pnSciczleS(hnmcecml.Schommemm..bcuofmfmS.ibzuefsf[SNiCzCeLs_[PNRCOCTLO__PSRIOMTPOL_ES]I/MNPCLCEL]_/SNTCECPLS_/SsTiEzPeSo/fs(iTz)e)o f{( T )| ) ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ { | group(group| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here687 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | 687 | p r i m s (ptriidm-st(itdiSdt-atritdRSetdaurcteB,c ansTth,r enaTdhsrReeadduscBec,a sntu,l l&pdtirr,e c&td-i>roeuctt,- >nouultl,p tarr,g sa-r>gsse-n>dsbeunfdfb,u fafr,g sa-r>grse-c>vrbeucfvfb,u f f| , ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h53::202 :note: 53in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | 202 | R u n WRournkWEolrekmEelnetmo(t)o.>r(u)n.(rwuen)(;w e )| ; ^ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppnote: :in instantiation of member function 'RunWork, 2, 2>::run' requested here8 :1: note: 11in instantiation of member function 'RunWork, 2, 2>::run' requested here | IMPL_C O8L | LI_MFPULN_CC(OALlLl_RFeUdNuCc(eA,l lCROeLdLuNcEeT,_ DCIORLELCNTE,T _SDIIMRPELCET,, PSrIeMMPuLlES,u mP,r efMluolaStu)m , | i^n t64_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:)391 : 95| :^ note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391 :39195 | : note: Rexpanded from macro 'IMPL_COLL_FUNC'u nWork< n391c | c l FRuunncW#o#rfkun,c #N#CdCeLv_rAeLdGoOp_<#t#yapleg>o,, NNCCCCLL__APLRGOOT_O#_##a#lpgroo,t oN>C(C)L._rPuRnO(T&On_c#c#lpSrhomteom>.(w)o.rrku)n;( &\n c c| l ^S hmem.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:k562):;15 :\ note: field 'nthreads' will be initialized after field 'tidInBlock'| ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15t:i dnote: (field 'nthreads' will be initialized after field 'tidInBlock't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~~~~~~~g roup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:g562r:o60u:p )note: ,field 'group' will be initialized after field 'stepSize' | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 60t:i dnote: (field 'group' will be initialized after field 'stepSize't id), n562t | h r e a dtsi(dn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, Pre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hM:u562l:S15u:m ,warning: initializer order does not match the declaration order [-Wreorder-ctor]f loat) | 562^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:(95t:i dnote: )expanded from macro 'IMPL_COLL_FUNC', nthrea d391s | ( n tRhurneWaodrsk)<,n ctcildFIunnBcl#o#cfku(ntch,r etaydpIed,x .Fxu)n,c #g#rdoeuvpr(egdroopu| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ N C| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L _ALG O563_ | # # a l gsot,e pNSCiCzLe_(PnRcOcTlOS_h#m#epmr.octoom>m(.)b.urfufnS(i&znecsc[lNSChCmLe_mP.RwOoTrOk_)S;I M\P L E| ] ^/ NCCL_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hT:E562P:S15/:s inote: zfield 'nthreads' will be initialized after field 'tidInBlock'e of(T)) 562{ | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t i| d group(group( tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hh:r666e:a9d:s (note: nin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heret hrea d666s | ) , t i d I n Bplroicmks((tthirde,a dnITdhxr.exa)d,s Ggartohuepr(,g rdoiurpe)c,t - >| u ^~~~~~~~~~~~~~~~~p , NU/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562,: 60a:r gnote: sfield 'group' will be initialized after field 'stepSize'- >sendb u562f | f , a rtgisd-(>triedc)v,b unftfh,r e a| d ^s (nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d202s:)53,: tnote: iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered InBl o202c | k ( t h r e a d IRduxn.Wxo)r,k Eglreomuepn(tg().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.houp(g:r562o:u15p:) ,warning: initializer order does not match the declaration order [-Wreorder-ctor] | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s562t | e p S i ztei(dn(ctcildS)h,m enmt.hcroemamd.sb(unftfhSriezaedss[)N,C CtLi_dPIRnOBTlOo_cSkI(MtPhLrEe]a/dNICdCxL._xS)T,E PgSr/osuipz(egorfo(uTp))), { | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)| group(group 563 | st/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:p677S:i11z:e (note: nin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec clShme m677. | c o m m . b u f f S ipzreism[sN(CtCiLd_-PtRiOdTSOt_aSrItMBPcLaEs]t/,N CnCTLh_rSeTaEdPsSB/csaiszte,o f&(dTi)r)e c{t - >| o ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~u t ,| group(groupd irect->down, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:-641>:s11e:n dnote: bin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu ff, ar g641s | - > r e c v b u f f ,p r i| m ^s (tid-tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:t202a:r53t:R enote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu ce, n202T | h r e a d s R e dRuucneW,o rdkiErleecmte-n>td oAultg,o ,a rPgrso-t>os>e(n)d.bruufnf(,w ea)r;g s -| > ^r ecvbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp,: 10 :| 1 ^: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :10202 | :I53M:P Lnote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereC OLL_ F202U | N C ( A l l R e dRuucneW,o rCkOELlLeNmEeTn_tDh(a)l.fr)u n (| w^e ); /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h| : ^391 :95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10: 1391: | note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereR unWor k10< | nIcMcPlLF_uCnOcL#L#_fFuUnNcC,( AtlylpRee,d uFcuen,c #C#OdLeLvNrEeTd_oDpI ,S INMCPCLLE_,A LPGrOe_M#u#laSlugmo,, hNaClCfL)_ P R| O^T O_##p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:o391t:o95>:( )note: .expanded from macro 'IMPL_COLL_FUNC'r un(&ncc l391S | h m eRmu.nwWoorrkk)<;n c\c l F| u ^n c##fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:c562,: 15t:y pnote: efield 'nthreads' will be initialized after field 'tidInBlock', Func #562# | d e v r etdiodp( ,n tNhCrCeLa_dAsL(GnOt_h#r#eaaldgso),, NtCiCdLI_nPBRlOoTcOk_(#t#hprreoatdoI>d(x)..xr)u,n (g&rnocucpl(Sghrmoeump.)w,o r k| ) ^~~~~~~~~~~~~~~~~; \ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h| : ^562 :60: note: field 'group' will be initialized after field 'stepSize'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: 562note: | field 'nthreads' will be initialized after field 'tidInBlock' ti d562( | t i d ) ,t indt(htrieda)d,s (nntthhrreeaaddss()n,t htriedaIdnsB)l,o ctki(dtIhnrBelaodcIkd(xt.hxr)e,a dgIrdoxu.px()g,r ogurpo)u,p ( g| r ^~~~~~~~~~~o up), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hh:r562e:a15d:I dwarning: xinitializer order does not match the declaration order [-Wreorder-ctor]. x), group (562g | r o u p )t,i d (| t ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i d )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) nthre a563d | s ( n t hsrteeapdSsi)z,e (tnicdcIlnSBhlmoecmk.(ctohmrme.abduIfdfxS.ixz)e,s [gNrCoCuLp_(PgRrOoTuOp_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ cclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hze(ncclShmem.comm.buffSi/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hzes[:N562C:C15L:_ Pwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]O TO_SIMPLE]/N C562C | L _ S T EtPiSd/(stiizde)o,f (nTt)h)r e{a d s| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I687n:B11l:o cnote: kin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( thread I687d | x . x ) , g r o u pp(rgirmosu(pt)i,d - t| i ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d S t| a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r tBcas t563, | n T h rsetaedpsSBiczaes(tn,c c&ldSihrmeecmt.-c>oomumt.,b unfuflSlipzters,[ NaCrCgLs_-P>RsOeTnOd_bSuIfMfP,L Ea]r/gNsC-C>Lr_eScTvEbPuSf/fs,i z e| o ^f (T)) {/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~53 : | note: group(groupin instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hu:n687W:o11r:k Enote: lin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree mentt(i)d.Srtuanr(twBec)a;s t ,| ^n Threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:s11B:c1a:s tnote: ,in instantiation of member function 'RunWork, 2, 2>::run' requested here &dir e11c | tI-M>PoLu_tC,O LnLu_lFlUpNtCr(,A lalrRgesd-u>csee,n dCbOuLfLfN,E Ta_rDgIsR-E>CrTe,c vSbIuMfPfL,E , | P ^r eMulSum, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:l202o:a53t:) note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h202: | 391 : 95 : note: expanded from macro 'IMPL_COLL_FUNC' RunW o391r | k E lReumneWnotrn(c)#.#rduenv(rweed)o;p < t| y ^p e>, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppA:L10G:O1_:# #note: ain instantiation of member function 'RunWork, 2, 2>::run' requested herel go, N C10C | LI_MPPRLO_TCOO_L#L#_pFrUoNtCo(>A(l)l.Rreudnu(c&en,c cClOSLhLmNeEmT._wDoIrRkE)C;T ,\ S I| M ^P LE,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :P562r:e15M:u lnote: Sfield 'nthreads' will be initialized after field 'tidInBlock'u m, ha l562f | ) | ^t id(t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hi:d391):,95 :n tnote: hexpanded from macro 'IMPL_COLL_FUNC'r eads(nt h391r | e a dRsu)n,W otrikd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:C60C:L _note: Afield 'group' will be initialized after field 'stepSize'L GO_## a562l | g o , NtCiCdL(_tPiRdO)T,O _n#t#hprreoatdos>((n)t.hrruena(d&sn)c,c ltSihdmIenmB.lwoocrkk()t;h r\e a d| I ^d x.x), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:r562o:u15p:( gnote: rfield 'nthreads' will be initialized after field 'tidInBlock'o up), | 562 ^~~~~~~~~~~ | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:T562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]S catter, N U562L | L , d itriedc(tt-i>du)p,, natrhgrse-a>dsse(nndtbhurfefa,d sa)r,g st-i>drIencBvlboucfkf(,t h r| e ^a dIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hx:)202,: 53g:r onote: uin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herep (gro u202p | ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u nWork E563l | e m e n tsf(S)i.zreusn[(NwCeC)L;_ P R| O ^T O_SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppE:]12/:N1C:C Lnote: _in instantiation of member function 'RunWork, 2, 2>::run' requested hereS TEPS /12s | iIzMePoLf_(CTO)L)L _{F U N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( A l| l group(groupR educe, C/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hO:L666L:N9E:T _note: Din instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereI RECT, SIM:PLE, 562P:r15e:M uwarning: linitializer order does not match the declaration order [-Wreorder-ctor]S um, double) | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391 :t95i:d (note: texpanded from macro 'IMPL_COLL_FUNC'i d), nthr e391a | d s (RnutnhWroerakd , | N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~C C L| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)A LGO_ #563# | a l g o ,s tNeCpCSLi_zPeR(OnTcOc_l#S#hpmreomt.oc>o(m)m..rbuunf(f&SniczcelsS[hNmCeCmL._wPoRrOkT)O;_ S\I M P| L ^E ]/NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:T562E:P15S:/ snote: ifield 'nthreads' will be initialized after field 'tidInBlock'z eof(T)) {562 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ t| i group(groupd (tid), nthread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hs:(666n:t9h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s), t666i | d I n B l o c k (ptrhirmesa(dtIiddx,. xn)T,h rgeraoduspG(agtrhoeurp,) ,d i r| e ^~~~~~~~~~~~~~~~~c t->up/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562N:UL60L:, note: afield 'group' will be initialized after field 'stepSize'r gs->sen d562b | u f f , tairdg(st-i>dr)e,c vnbtuhfrfe,a d s| ( ^n thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hs:)202,: 53t:i dnote: Iin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heren Bloc k202( | t h r e a d I d xR.uxn)W,o rgkrEoluepm(egnrto().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorkwarning: ,initializer order does not match the declaration order [-Wreorder-ctor] NCCL_ALGO _562# | # a l g ot,i dN(CtCiLd_)P,R OnTtOh_r#e#apdrso(tnot>h(r)e.ardusn)(,& ntcicdlISnhBmleomc.kw(otrhkr)e;a d\I d x| . ^x ), group(g/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:o562u:p15):, note: field 'nthreads' will be initialized after field 'tidInBlock'| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 562 | 563 | t i d (sttiedp)S,i znet(hnrcecaldSsh(mnetmh.rceoamdms.)b,u ftfiSdiIzneBsl[oNcCkC(Lt_hPrReOaTdOI_dSxI.MxP)L,E ]g/rNoCuCpL(_gSrToEuPpS)/,s i z| e ^~~~~~~~~~~~~~~~~o f(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h):)562 :{60 : | note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~field 'group' will be initialized after field 'stepSize' | group(group 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 677n:t11h:r enote: ain instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hered s(nthre a677d | s ) , t i d I n B lporcikm(st(htrieda-dtIiddxS.txa)r,t Bgcraosutp,( gnrTohurpe)a,d s B| c ^~~~~~~~~~~a st, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ lement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:N562E:T15_:D Iwarning: Rinitializer order does not match the declaration order [-Wreorder-ctor]E CT, SIMPLE, 562P | r e M u ltSiudm(,t ifdl)o,a tn)t h r| e^a ds(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: )expanded from macro 'IMPL_COLL_FUNC', tidI n391B | l o cRku(ntWhorreka ,s tNeCpCSLi_zAeL(GnOc_c#l#Sahlmgeom,. cNoCmCmL._bPuRfOfTSOi_z#e#sp[rNoCtCoL>_(P)R.OrTuOn_(S&InMcPcLlES]h/mNeCmC.Lw_oSrTkE)P;S /\s i z| e ^o f(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :{562 : 15| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: field 'nthreads' will be initialized after field 'tidInBlock'| group(group 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:i655d:(11t:i dnote: )in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, nthre a655d | s ( n t h r e a d s )p,r itmisd(ItniBdl-otcikd(SttharretaRdeIdduxc.ex,) ,n Tghrroeuapd(sgRreoduupc)e,, n| u ^~~~~~~~~~~~~~~~~l lptr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h,: 562&:d60i:r enote: cfield 'group' will be initialized after field 'stepSize't ->out ,562 | a r g s -t>isde(ntdibdu)f,f ,n tahrrgesa-d>sr(enctvhbruefafd,s ) ,| ^t idInBl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:c202k:(53t:h rnote: ein instantiation of member function 'RunWorkElement, 2, 2>::run' requested herea dIdx .202x | ) , g r o u p (RgurnoWuopr)k,E l e| m ^~~~~~~~~~~e nt().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562P:R15O:T Owarning: _initializer order does not match the declaration order [-Wreorder-ctor]S IMPLE]/NC C562L | _ S T E PtSi/ds(itziedo)f,( Tn)t)h r{e a d| s ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~( n t| h group(groupr eads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hd:I666n:B9l:o cnote: kin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( threa d666I | d x . x ) , g rporuipm(sg(rtoiudp,) ,n T h| r ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~e a d| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)G ather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h-:>562s:e15n:d bwarning: uinitializer order does not match the declaration order [-Wreorder-ctor]f f, args-> r562e | c v b u ftfi,d ( t| i ^d ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hrea d202s | ) , t i d I n BRluoncWko(rtkhErleeamdeIndtx<.Fxn),, Tg,r oRuepd(Ogpr,o uApl)g,o , | P ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r o t| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)> ().ru n563( | w e ) ; s t| e ^p Size(nc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppc:l10S:h1m:e mnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herec omm. b10u | fIfMSPiLz_eCsO[LNLC_CFLU_NPCR(OATlOl_RSeIdMuPcLeE,] /CNOCLCLLN_ESTT_EDPISR/EsCiTz,e oSfI(MTP)L)E ,{ P r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M u l| S group(groupu m, half/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h): 641 :| 11^: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: 641note: | expanded from macro 'IMPL_COLL_FUNC' 391 | p rRiumnsW(otrikd<-ntcicdlSFtuanrct#R#efduuncce,, tnyTpher,e aFdusnRce#d#udceev,r eddiorpeed>o,w nN,C C&Ld_iArLeGcOt_-#>#oaultg,o ,a rNgCsC-L>_sPeRnOdTbOu_f#f#,p raortgos>-(>)r.ercuvnb(u&fnfc,c l S| h ^m em.wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hk:)202;: 53\: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h202: | 562 : 15 : note: field 'nthreads' will be initialized after field 'tidInBlock' RunW o562r | k E l e mteindt(t(i)d.IrnuBnl(owcek)(;t h r| e ^a dIdx./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppx:)12,: 1g:r onote: uin instantiation of member function 'RunWork, 2, 2>::run' requested herep (grou p12) | ,I M P| L ^~~~~~~~~~~~~~~~~_ CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562_:F60U:N Cnote: (field 'group' will be initialized after field 'stepSize'A llRed u562c | e , C OtLiLdN(EtTi_dD)I,R EnCtTh,r eSaIdMsP(LnEt,h rPeraedMsu)l,S utmi,d IdnoBulbolcek)( t h| r^e adI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:x391.:x95):, note: gexpanded from macro 'IMPL_COLL_FUNC'r oup(gr o391u | p ) ,R u n| W ^~~~~~~~~~~o rk, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Sizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreadsI(nthreMaPdLsE),, PtriedMIunlBSluomc,k (dtohurbelaed)I d x| .^x ), group/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:g391r:o95u:p )note: ,expanded from macro 'IMPL_COLL_FUNC' | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :391562 | : 60 :R unote: nfield 'group' will be initialized after field 'stepSize'W ork< n562c | c l F u ntci#d#(ftuindc),, tnytpher,e aFdusn(cn#t#hdreevardesd)o,p B,l oNcCkC(Lt_hArLeGaOd_I#d#xa.lxg)o,, gNrCoCuLp_(PgRrOoTuOp_)#,# p r| o ^~~~~~~~~~~t o>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c15k:( twarning: hinitializer order does not match the declaration order [-Wreorder-ctor]r eadIdx.x )562, | g r o utpi(dg(rtoiudp)),, n t| h ^~~~~~~~~~~r eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d )R,u nnWtohrrkeEaldesm(enntthd(x)..xr)u,n (gwreo)u;p ( g| r ^o up), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp| : ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~12 : 1| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 563 | 12 | I M PsLt_eCpOSLiLz_eF(UnNcCc(lASlhlmReemd.uccoem,m .CbOuLfLfNSEiTz_eDsI[RNECCCTL,_ PSRIOMTPOL_ES,I MPPrLeEM]u/lNSCuCmL,_ SdToEuPbSl/es)i z e| o^f (T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :{391 : 95| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ note: expanded from macro 'IMPL_COLL_FUNC'| group(group 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :R666u:n9W:o rnote: kin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here< ncclF u666n | c # # f u n c , ptryipmes,( tFiudn,c #n#TdherveraeddsoGpa ,d iNrCeCcLt_-A>LuGpO,_ #N#UaLlLg,o ,a rNgCsC-L>_sPeRnOdTbOu_f#f#,p raortgos>-(>)r.ercuvnb(u&fnfc,c l S| h ^m em.wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:k202):;53 :\ note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here| ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: Rfield 'nthreads' will be initialized after field 'tidInBlock'u nWork E562l | e m e n tt)(,) .triudnI(nwBel)o;c k (| t ^h read/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppI:d12x:.1x:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereg roup (group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g15r:o uwarning: pinitializer order does not match the declaration order [-Wreorder-ctor]) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)562 | t563i | d ( t i ds)t,e pnStihzree(andcsc(lnSthhmreema.dcso)m,m .tbiudfIfnSBilzoecsk[(NtChCrLe_aPdRIOdTxO._xS)I,M PgLrEo]u/pN(CgCrLo_uSpT)E,P S /| s ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~i z e| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)f (T)) {563 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ s| t group(groupe pSize(nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hS:h655m:e11m:. cnote: oin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herem m.buf f655S | i z e s [ N C C L _ PpRrOiTmOs_(StIiMdP-LtEi]d/SNtCaCrLt_RSeTdEuPcSe/,s inzTehorfe(aTd)s)R e{d u c| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, n| u group(groupl lptr, &d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:r655e:c11t:- >note: oin instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu t, ar g655s | - > s e n d b u f f ,p rairmgss(-t>irde-ctvibduSftfa,r t R| e ^d uce, nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hr:e202a:d53s:R enote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereu ce, n202u | l l p t r , & dRiurneWcotr-k>Eoluetm,e natr sTe,n dRbeudfOfp,, aArlggso-,> rPerotcov>b(u)f.fr,u n (| w ^e ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp :note: 11in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here: 1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here202 | 11 | IMPL_ C O L L _RFuUnNWCo(rAklEllReemdeuncte<,F nC,O LTL,N ERTe_dDOIpR,E CATl,g oS,I MPPrLoEt,o >P(r)e.MruulnS(uwme,) ;f l o| a ^t ) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::1391:: 95note: :in instantiation of member function 'RunWork, 2, 2>::run' requested here note: expanded from macro 'IMPL_COLL_FUNC' 11 | I M391P | L _ CROuLnLW_oFrUkNl,S uNmC,C Lf_lAoLaGtO)_ # #| a^l go, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C391C:L95_:P Rnote: Oexpanded from macro 'IMPL_COLL_FUNC'T O_##pr o391t | o > (R)u.nrWuonr(k&562, | N C C Lt_iAdL(GtOi_d#)#,a lngtoh,r eNaCdCsL(_nPtRhOrTeOa_d#s#)p,r ottiod>I(n)B.lroucnk((&tnhcrcelaSdhImdexm..xw)o,r kg)r;o u\p ( g| r ^o up),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~~~~~~~15 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hfield 'nthreads' will be initialized after field 'tidInBlock': 562:60: note: 562field 'group' will be initialized after field 'stepSize' | t i562d | ( t i d )t,i dn(tthirde)a,d sn(tnhtrheraedasd(sn)t,h rteiaddIsn)B,l otcikd(ItnhBrleoacdkI(dtxh.rxe)a,d Igdrxo.uxp)(,g rgoruopu)p,( g r| o ^~~~~~~~~~~~~~~~~u p),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : ^~~~~~~~~~~60 : note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :R562u:n15W:o rwarning: kinitializer order does not match the declaration order [-Wreorder-ctor]E lement< F562n | , T , tRiedd(Otpi,d )A,l gnot,h rPeraodtso(>n(t)h.rreuand(sw)e,) ;t i d| I ^n Block(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppr:e11a:d1I:d xnote: .in instantiation of member function 'RunWork, 2, 2>::run' requested herex ), gr o11u | pI(MgPrLo_uCpO)L,L _ F| U ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~N C (| A tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l lRedu c563e | , C O LsLtNeEpTS_iDzIeR(EnCcTc,l SShImMePmL.Ec,o mPmr.ebMuuflfSSuimz,e sf[lNoCaCtL)_ P R| O^T O_SIMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hL:E391]:/95N:C Cnote: Lexpanded from macro 'IMPL_COLL_FUNC'_ STEPS/s i391z | e o fR(uTn)W)o r{k < n| c ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~c l F| u group(groupn c##func,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h :t666y:p9e:, note: Fin instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereu nc## d666e | v r e d o p < t ypprei>m,s (NtCiCdL,_ AnLTGhOr_e#a#daslGgaot,h eNrC,C Ld_iPrReOcTtO-_>#u#pp,r oNtUoL>L(,) .arrugns(-&>nscecnldSbhumfefm,. waorrgks)-;> r\e c v| b ^u ff, | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:15: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hnote: :field 'nthreads' will be initialized after field 'tidInBlock'202 :53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 562 | 202 | t i d ( t i d )R,u nnWtohrrkeEaldesm(enntthd(x)..xr)u,n (gwreo)u;p ( g| r ^o up),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp : 12| : ^~~~~~~~~~~~~~~~~1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562: 6012: | Inote: Mfield 'group' will be initialized after field 'stepSize'P L_COLL _562F | U N C ( AtlildR(etdiudc)e,, nCtOhLrLeNaEdTs_(DnItRhErCeTa,d sS)I,M PtLiEd,I nPBrleoMcukl(Stuhmr,e addoIudbxl.ex)) , | g^r oup/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h(:g391r:o95u:p )note: ,expanded from macro 'IMPL_COLL_FUNC' | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElementt(i)d.(rtuind()w,e )n;t h r| e ^a ds(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppe:a13d:s1):, note: tin instantiation of member function 'RunWork, 2, 2>::run' requested herei dInB l13o | cIkM(PtLh_rCeOaLdLI_dFxU.NxC)(,A lglrRoeudpu(cger,o uCpO)L,L N E| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~_ D I| R tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)E CT, S I563M | P L E , sPtreepMSuilzSeu(mn,c crlcSchlm_ebmf.lcooamtm1.6b)u f f| S^i zes[/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hN:C391C:L95_:P Rnote: Oexpanded from macro 'IMPL_COLL_FUNC'T O_SIMP L391E | ] / NRCuCnLW_oSrTkEin instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here, NCC L666_ | A L G O _ # # a lgpor,i mNsC(CtLi_dP,R OnTTOh_r#e#apdrsoGtaot>h(e)r.,r udni(r&encctc-l>Suhpm,e mN.UwLoLr,k )a;r g\s - >| s ^e ndbuf/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hf:,562 :a15r:g snote: -field 'nthreads' will be initialized after field 'tidInBlock'> recvb u562f | f , | t ^i d(tid),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :n202t:h53r:e anote: din instantiation of member function 'RunWorkElement, 2, 2>::run' requested heres (nth r202e | a d s ) , t i dRIunnBWloorckkE(ltehmreenatd()/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h.:r562u:n60(:w enote: )field 'group' will be initialized after field 'stepSize'; | ^ 562 | tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppt:i10d:)1,: nnote: tin instantiation of member function 'RunWork, 2, 2>::run' requested hereh reads( n10t | hIrMePaLd_sC)O,L Lt_iFdUINnCB(lAolclkR(etdhurceea,d ICdOxL.LxN)E,T _gDrIoRuEpC(Tg,r oSuIpM)P,L E ,| ^~~~~~~~~~~P reMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement (t)i.dr(utni(dw)e,) ;n t h| r ^e ads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppi:d13I:n1B:l onote: cin instantiation of member function 'RunWork, 2, 2>::run' requested herek (thre a13d | IIdMxP.Lx_)C,O LgLr_oFuUpN(Cg(rAolulpR)e,d u c| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, C| O tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L LNET _563D | I R E C Ts,t eSpISMiPzLeE(,n cPcrleSMhumleSmu.mc,o mrmc.cblu_fbffSliozaets1[6N)C C L| _^P ROTO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h_:S391I:M95P:L Enote: ]expanded from macro 'IMPL_COLL_FUNC'/ NCCL_S T391E | P S /RsuinzWeoorfk(, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested heree vredo p687< | t y p e > , N C C Lp_rAiLmGsO(_t#i#da-ltgiod,S tNaCrCtLB_cPaRsOtT,O _n#T#hprreoatdos>B(c)a.srtu,n (&&dnicrcelcSth-m>eomu.tw,o rnku)l;l p\t r ,| ^a rgs->se/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:d562b:u15f:f ,note: field 'nthreads' will be initialized after field 'tidInBlock'a rgs- >562r | e c v b utfifd,( t i| d ^) , nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:a202d:s53(:n tnote: hin instantiation of member function 'RunWorkElement, 2, 2>::run' requested herer ead s202) | , t i d I n B lRoucnkW(otrhkrEelaedmIednxt.:(562):.60r:u nnote: (field 'group' will be initialized after field 'stepSize'w e); 562| | ^ ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppd:(11t:i1d:) ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested heren thre a11d | sI(MnPtLh_rCeOaLdLs_)F,U NtCi(dAIlnlBRleodcukc(et,h rCeOaLdLINdExT._xD)I,R EgCrTo,u pS(IgMrPoLuEp,) ,P r e| M ^~~~~~~~~~~u lSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i15d:( twarning: iinitializer order does not match the declaration order [-Wreorder-ctor]d ), nthre a562d | s ( n t htrieda(dtsi)d,) ,t indtIhnrBelaodcsk((ntthhrreeaaddIsd)x,. xt)i,d IgnrBoluopc(kg(rtohurpe)a,d I d| x ^~~~~~~~~~~. x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e15m:. wwarning: oinitializer order does not match the declaration order [-Wreorder-ctor]r k); \ | ^562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :t562i:d15(:t inote: dfield 'nthreads' will be initialized after field 'tidInBlock') , nt h562r | e a d s (tnitdh(rteiadd)s,) ,nt htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ g r| o tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)u p(gro u563p | ) , | s ^~~~~~~~~~~~~~~~~t ep/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:i562z:e60(:n cnote: cfield 'group' will be initialized after field 'stepSize'l Shme m562. | c o m m .tbiudf(ftSiidz)e,s [nNtChCrLe_aPdRsO(TnOt_hSrIeMaPdLsE)],/ NtCiCdLI_nSBTlEoPcSk/(stihzreeoafd(ITd)x). x{) , | g ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r o u| p group(group( group), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h| : ^~~~~~~~~~~641 :11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement() .562r | u n ( w et)i;d ( t| i ^d ), nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppa:d12s:(1n:t hnote: rin instantiation of member function 'RunWork, 2, 2>::run' requested heree ads), 12t | iIdMIPnLB_lCoOcLkL(_tFhUrNeCa(dAIldlxR.exd)u,c eg,r oCuOpL(LgNrEoTu_pD)I,R E C| T ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, S| I tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)M PLE, 563P | r e M u lsStuemp,S idzoeu(bnlcec)l S h| m^e m.comm/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h.:b391u:f95f:S inote: zexpanded from macro 'IMPL_COLL_FUNC'e s[NCCL _391P | R O TROu_nSWIoMrPkL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h,: 655N:C11C:L _note: Ain instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested hereL GO_## a655l | g o , N C C L _ P RpOrTiOm_s#(#tpirdo-ttoi>d(S)t.arrutnR(e&dnucccel,S hnmTehmr.ewaodrskR)e;d u\c e ,| ^n ullpt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:,562 :&15d:i rnote: efield 'nthreads' will be initialized after field 'tidInBlock'c t->ou t562, | a r g st-i>ds(etniddb)u,f fn,t harregasd-s>(rnetchvrbeuafdfs,) , | t ^i dInBlo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hc:k202(:t53h:r enote: ain instantiation of member function 'RunWorkElement, 2, 2>::run' requested hered Idx .202x | ) , g r o u p (RgurnoWuopr)k,E l e| m ^~~~~~~~~~~~~~~~~e ntt(i)d.)r,u nn(twher)e;a d s| ( ^n thread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpps:)12,: 1t:i dnote: Iin instantiation of member function 'RunWork, 2, 2>::run' requested heren Bloc k12( | tIhMrPeLa_dCIOdLxL._xF)U,N Cg(rAolulpR(egdruocuep,) ,C O L| L ^~~~~~~~~~~N ET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hCL_PR:O562T:O15_:# #warning: pinitializer order does not match the declaration order [-Wreorder-ctor]r oto>().run 562 | ( & n ctcildS(htmiedm).,w onrtkh)r;e a\d s (| n ^t hread/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:)562,: 15t:i dnote: Ifield 'nthreads' will be initialized after field 'tidInBlock'n Block (562t | h r e a dtIiddx(.txi)d,) ,g rnotuhpr(egardosu(pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s )| , tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) tidI n563B | l o c k (sttherpeSaidzIed(xn.cxc)l,S hgmreomu.pc(ogmrmo.ubpu)f,f S i| z ^~~~~~~~~~~~~~~~~e s[N/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:C562L:_60P:R Onote: Tfield 'group' will be initialized after field 'stepSize'O _SIMP L562E | ] / N C CtLi_dS(TtEiPdS)/,s inztehorfe(aTd)s)( n{t h r| e ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a d s| ) group(group, tidInBlock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ht:h677r:e11a:d Inote: din instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herex .x), g677r | o u p ( g r o u p ) ,p r i| m ^~~~~~~~~~~s (tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:]562/:N15C:C Lwarning: _initializer order does not match the declaration order [-Wreorder-ctor]S TEPS/sizeo f562( | T ) ) {t i d| ( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t i d| ) group(group, nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.he:a641d:s11):, note: tin instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herei dInBl o641c | k ( t h r e a d I d xp.rxi)m,s (gtriodu-pt(igdrSotuapr)t,R e d| u ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~c e ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n Thread s563R | e d u c es,t edpiSriezcet(-n>cdcolwSnh,m e&md.icroemcmt.-b>uofuftS,i zaersg[sN-C>CsLe_nPdRbOuTfOf_,S IaMrPgLsE-]>/rNeCcCvLb_uSfTfE,P S /| s ^i zeof(T/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h):)202 :{53 : | note: ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here | group(group 202 | RunWor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hk:E687l:e11m:e nnote: tin instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here< Fn, T, 687R | e d O p , A l g o ,p rPirmost(ot>i(d)-.triudnS(twaer)t;B c a| s ^t , nTh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppr:e11a:d1s:B cnote: ain instantiation of member function 'RunWork, 2, 2>::run' requested heres t, &11d | iIrMePcLt_-C>OoLuLt_,F UnNuCl(lApltlrR,e daurcges,- >CsOeLnLdNbEuTf_fD,I RaErCgTs,- >SrIeMcPvLbEu,f fP,r e M| u ^l Sum, float/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h): 202 :| 53^: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95 :202 | note: expanded from macro 'IMPL_COLL_FUNC' 391R | u n WRournkWEolrekmv(r)e.droupn<(twyep)e;> , | N ^C CL_ALGO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp#:#12a:l1g:o ,note: in instantiation of member function 'RunWork, 2, 2>::run' requested hereN CCL_P R12O | TIOM_P#L#_pCrOoLtLo_>F(U)N.Cr(uAnl(l&RnecdculcSeh,m em.worCkO)L;L N\E T _| D ^I RECT,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :S562I:M15P:L Enote: ,field 'nthreads' will be initialized after field 'tidInBlock' PreM u562l | S u tmi,d (dtoiudb)l,e )n t h| r^e ads(n/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d snote: )expanded from macro 'IMPL_COLL_FUNC', tidIn B391l | o c kR(utnhWroerakd562, | N C C Lt_iAdL(GtOi_d#)#,a lngtoh,r eNaCdCsL(_nPtRhOrTeOa_d#s#)p,r ottiod>I(n)B.lroucnk((&tnhcrcelaSdhImdexm..xw)o,r kg)r;o u\p ( g| r ^o up), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^~~~~~~~~~~: 562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:T562):)15 :{ warning: initializer order does not match the declaration order [-Wreorder-ctor]| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hi:d666(:t9i:d )note: ,in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here nthre a666d | s ( n t h r e a dpsr),i mtsi(dtIindB,l oncTkh(rtehardesaGdaItdhxe.rx,) ,d igrreocutp-(>gurpo,u pN)U,L L ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a r g| s tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)- >send b563u | f f , asrtgesp-S>irzeec(vnbcucflfS,h m e| m ^. comm.bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hf:f202S:i53z:e snote: [in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereN CCL_ P202R | O T O _ S I M P LREu]n/WNoCrCkLE_lSeTmEePnSt/()./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.hr:u677n:(11w:e )note: ;in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ^ 677 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp : 13 : 1 : note: in instantiation of member function 'RunWork, 2, 2>::run' requested here pri m13s | (ItMiPdL-_tCiOdLSLt_aFrUtNBCc(aAsltl,R endTuhcree,a dCsOBLcLaNsEtT,_ D&IdRiErCeTc,t -S>IoMuPtL,E ,d iPrreecMtu-l>Sduomw,n ,r cacrlg_sb-f>lsoeantd1b6u)f f ,| ^a rgs-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h>:r391e:c95v:b unote: fexpanded from macro 'IMPL_COLL_FUNC'f , | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hR:u202n:W53o:r knote: , 2, 2>::run' requested heren cclF u202n | c # # f u n c , RtuynpWeo,r kFEulnecm#e#ndtep,, NAClCgLo_,A LPGrOo_t#o#>a(l)g.or,u nN(CwCeL)_;P R O| T ^O _##pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppo:t12o:>1(:) .note: rin instantiation of member function 'RunWork, 2, 2>::run' requested hereu n(& n12c | cIlMSPhLm_eCmO.LwLo_rFkU)N;C (\A l l| R ^e duce,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :C562O:L15L:N Enote: Tfield 'nthreads' will be initialized after field 'tidInBlock'_ DIRECT ,562 | S I M P LtEi,d (PtriedM)u,l Snutmh,r edaodusb(lnet)h r e| a^d s),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :t391i:d95I:n Bnote: lexpanded from macro 'IMPL_COLL_FUNC'o ck(th r391e | a d IRduxn.Wxo)r,k d,( tNiCdC)L,_ AnLtGhOr_e#a#dasl(gnot,h rNeCaCdLs_)P,R OtTiOd_I#n#Bplrooctko(>t(h)r.eraudnI(d&xn.cxc)l,S hgmreomu.pw(ogrrko)u;p )\, | | ^ ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, CO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:L562N:E15T:_ Dwarning: Iinitializer order does not match the declaration order [-Wreorder-ctor]R ECT, SIMP L562E | , P r etMiudl(Stuimd,) ,d onutbhlree)a d s| (^n thre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:d391s:)95,: tnote: iexpanded from macro 'IMPL_COLL_FUNC'd InBloc k391( | t h rReuandWIodrxk.i,z eN(CnCcLc_lASLhGmOe_m#.#caolmgmo.,b uNfCfCSLi_zPeRsO[TNOC_C#L#_pPrRoOtToO>_(S)I.MrPuLnE(]&/nNcCcClLS_hSmTeEmP.Sw/osrikz)e;o f\( T )| ) ^ { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562| : group(group15 : note: field 'nthreads' will be initialized after field 'tidInBlock' 562/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h | : 666 : 9 :t inote: din instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid), 666n | t h r e a d s ( nptrhirmesa(dtsi)d,, tniTdhIrneBaldoscGka(tthherre,a ddIidrxe.cxt)-,> ugpr,o uNpU(LgLr,o uapr)g,s - >| s ^~~~~~~~~~~~~~~~~e nd/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hb:u562f:f60,: anote: rfield 'group' will be initialized after field 'stepSize'g s->r e562c | v b u f ft,i d (| t ^i d), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hh:r202e:a53d:s (note: nin instantiation of member function 'RunWorkElement, 2, 2>::run' requested heret hrea d202s | ) , t i d I n BRluoncWko(rtkhErleeamdeIndtx<.Fxn),, Tg,r oRuepd(Ogpr,o uApl)g,o , | P ^~~~~~~~~~~r oto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T))/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx908. 67 warnings generated when compiling for gfx940. 67 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidSta/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:t562S:c15a:t twarning: einitializer order does not match the declaration order [-Wreorder-ctor]r , nThread s562S | c a t t etri,d (NtUiLdL),, dnitrherceta-d>su(pn,t harregasd-s>)s,e ntdibduIfnfB,l oacrkg(st-h>rreeacdvIbduxf.fx,) , | g ^r oup(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:u202p:)53,: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 202 | 563 | R usntepSWiozrek(EnlcecmleSnhtmT(O)_.SrIuMnP(LwEe])/;N C C| L ^_ STEPS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp/:s4i:z1e:o fnote: (in instantiation of member function 'RunWork, 2, 2>::run' requested hereT )) { 4 | | I ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M P L| _ group(groupC OLL_FUNC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h(:A677l:l11R:e dnote: uin instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested herec e, COL L677N | E T _ D I R E C T , pSrIiMmPsL(Et,i dP-rteiMduSltSaurmt,B cianstt8,_ tn)T h r| e^a dsBc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ha:s391t:,95 :& dnote: iexpanded from macro 'IMPL_COLL_FUNC'r ect->o u391t | , dRiurneWcotr-k>usnecn,d btuyfpfe,, aFrugnsc-#>#rdeecvvrbeudfofp,< t y| p ^e >, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:L202_:A53L:G Onote: _in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here# #alg o202, | N C C L _ P R ORTuOn_W#o#rpkrEolteom>e(n)t. ^( ).run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:w562e:)15;: note: | field 'nthreads' will be initialized after field 'tidInBlock' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp562: | 6 : 1 : tnote: iin instantiation of member function 'RunWork, 2, 2>::run' requested hered (tid )6, | InMtPhLr_eCaOdLsL(_nFtUhNrCe(aAdlsl)R,e dtuicdeI,n BClOoLcLkN(EtTh_rDeIaRdEICdTx,. xS)I,M PgLrEo,u pP(rgerMouulpS)u,m , | i ^~~~~~~~~~~~~~~~~n t3/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h2:_562t:)60 : | note: ^field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h: 391562: | 95 : note: expanded from macro 'IMPL_COLL_FUNC't id(tid )391, | n tRhurneWaodrsk(g,r oNuCpC)L,_ A L| G ^~~~~~~~~~~O _##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 4 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:t562i:d15):, warning: ninitializer order does not match the declaration order [-Wreorder-ctor]t hreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduIpn(Bglroocukp()t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d I d| x tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T). x), g r563o | u p ( g rsotuepp)S,i z e| ( ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~n c c| l tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S hmem. c563o | m m . b usftfeSpiSziezse[(NnCcCcLl_SPhRmOeTmO._cSoImMmP.LbEu]f/fNSCiCzLe_sS[TNECPCSL/_sPiRzOeToOf_(STI)M)P L{E ] /| N ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C C L| _ group(groupS TEPS/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.ho:f677(:T11):) note: {in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | 677 group(group | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h : 626 :p9r:i mnote: sin instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here( tid-t i626d | S t a r t B c a sptr,i mnsT(htrieda-dtsiBdcSatsatr,t S&cdaitrteecrt,- >noTuhtr,e addisrSeccatt-t>edro,w nN,U LaLr,g sd-i>rseecntd-b>uufpf,, aarrggss-->>sreencdvbbuuffff,, a r| g ^s ->recv/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hb:u202f:f53,: note: | in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested hereR unWor k202E | l e m e n t < F nR,u nTW,o rRkeEdlOepm,e nAtld(O)p.,r uAnl(gwoe,) ;P r o| t ^o >().r/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cppu:n7(:w1e:) ;note: in instantiation of member function 'RunWork, 2, 2>::run' requested here | ^ 7 | IMPL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp_:C5O:L1L:_ Fnote: Uin instantiation of member function 'RunWork, 2, 2>::run' requested hereN C(All R5e | dIuMcPeL,_ CCOOLLLL_NFEUTN_CD(IARlElCRTe,d SuIcMeP,L EC,O LPLrNeEMTu_lDSIuRmE,C Tu,i nStI3M2P_LtE), P| r^e MulS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hu:m391,: 95u:i nnote: texpanded from macro 'IMPL_COLL_FUNC'8 _t) | 391^ | Ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:W391o:r95k:< nnote: cexpanded from macro 'IMPL_COLL_FUNC'c lFunc## f391u | n c ,R utnyWpoer,k e,, NFCuCnLc_#A#LdGeOv_r#e#daolpgC,L _NPCRCOLT_OA_L#G#Op_r#o#taol>g(o),. rNuCnC(L&_nPcRcOlTSOh_m#e#mp.rwootrok>)(;) .\r u n| ( ^& nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hS:h562m:e15m:. wnote: ofield 'nthreads' will be initialized after field 'tidInBlock'r k); \562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:i562d:)15,: nnote: tfield 'nthreads' will be initialized after field 'tidInBlock'h reads(n t562h | r e a d st)i,d (ttiiddI)n,B lnotchkr(etahdrse(andtIhdrxe.axd)s,) ,g rtoiudpI(ngBrlooucpk)(,t h r| e ^~~~~~~~~~~~~~~~~a dI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:x562.:x60):, note: gfield 'group' will be initialized after field 'stepSize'r oup( g562r | o u p ) ,t i d| ( ^~~~~~~~~~~~~~~~~t id),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :n562t:h60r:e anote: dfield 'group' will be initialized after field 'stepSize's (nthre a562d | s ) , ttiiddI(ntBildo)c,k (ntthhrreeaaddIsd(xn.txh)r,e agdrso)u,p (tgirdoIunpB)l,o c k| ( ^~~~~~~~~~~t hreadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 5 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 6 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/siz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 7 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 8 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 9 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 10 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 11 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 12 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:626:9: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 626 | prims(tid-tidStartScatter, nThreadsScatter, NULL, direct->up, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:641:11: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 641 | prims(tid-tidStartReduce, nThreadsReduce, direct->down, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:655:11: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 655 | prims(tid-tidStartReduce, nThreadsReduce, nullptr, &direct->out, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:666:9: note: in instantiation of member function 'Primitives, FanAsymmetric<7, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 666 | prims(tid, nThreadsGather, direct->up, NULL, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:677:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 7>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 677 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, direct->down, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/all_reduce.h:687:11: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 0>::Primitives' requested here 687 | prims(tid-tidStartBcast, nThreadsBcast, &direct->out, nullptr, args->sendbuff, args->recvbuff, | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 2, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 2, 2>::run' requested here 13 | IMPL_COLL_FUNC(AllReduce, COLLNET_DIRECT, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 67 warnings generated when compiling for host. 67 warnings generated when compiling for gfx900. 67 warnings generated when compiling for gfx803. 67 warnings generated when compiling for gfx1102. 67 warnings generated when compiling for gfx1101. 67 warnings generated when compiling for gfx1100. 67 warnings generated when compiling for gfx906. 67 warnings generated when compiling for gfx1030. 67 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 212 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:212:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 212 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 8>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:224:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 224 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for gfx908. 9 warnings generated when compiling for gfx906. 9 warnings generated when compiling for gfx1100. 9 warnings generated when compiling for gfx940. 9 warnings generated when compiling for gfx1030. 9 warnings generated when compiling for gfx900. 9 warnings generated when compiling for gfx1101. 9 warnings generated when compiling for gfx1102. 9 warnings generated when compiling for gfx941. 9 warnings generated when compiling for gfx803. 9 warnings generated when compiling for gfx90a. 9 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:214:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 214 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:149:62: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 0>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 149 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:216:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runRecv>' requested here 216 | runRecv>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1, 4>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:226:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 226 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:87:62: note: in instantiation of member function 'Primitives, FanAsymmetric<0, 1>, 0, ProtoSimple<1, 1>, 1>::Primitives' requested here 87 | Primitives, 0, Proto, 1> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/sendrecv.h:228:9: note: in instantiation of function template specialization 'RunWork, 1, 2>::runSend>' requested here 228 | runSend>(tid, nthreads, group, args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(SendRecv, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 9 warnings generated when compiling for host. 9 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr mmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ :134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ rc, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 1515 warnings generated when compiling for gfx90a. warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 88%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h169:: 154/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h::10509:: 29warning: :variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 154 | c a507s | e 3 : t i| d ^( tid), nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cppr:e5a:d9s:( nnote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereh read s5) | , w i d ( t i dM%SWCACRLP__ISMIPZLE_)K,E RwNaErLp_(EtNiTdR/YW_AFRUPN_CS_IDZEEV)R,E D O| P ~~~~~~~~~~~~~~~~~~_ T Y| P stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)E (Sum ,508 | u i n t 8w_atr,p IfnaBllsoec)k;( t h| r ^e adIdx/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h.:x402/:W3A:R Pnote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'S IZE), 402| | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ m| s warp(tid/WARP_SIZEc clRu n509I | n t e r pfrleatgeTrho,u pP)r,o t o| L ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~L 1 2| 8 warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3, full O510p | s > ( c osmtme,p Sailzgeo(,n cwcolrSkh)m;e m\. c o| m ^m .bu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:f165S:i33z:e snote: [uninitialized use occurs hereN CCL_P R165O | T O _ L Lc1o2p8y]T/oNSChCmLe_mS8T(EtPiSd/%sWiAzRePo_fS(IuZiEn,t 6d4s_tt,) )s r{c , | b ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y t e| s group(group) ; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::217162::575:: note: warning: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herevariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | 217 | Pdreifmaiutlitv:e s <| T ^~~~~~~, R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:d165O:p33,: Fnote: auninitialized use occurs heren Asym m165e | t r i c T,o S1h,m ePmr8o(ttoi,d %0W>A RpPr_iSmIsZ E ,| ^d st, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpps:r5c:,9 :b ynote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested heree s); 5| | ^~~ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hL:11342:814,: fnote: uinitialize the variable 'dst' to silence this warningl lOp s134> | ( c o m mv,o iadl g*od,s tw,o r*ks)r;c ;\ | | ^ ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h*:s154r:c10;: warning: | variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] ^ | = nullptr154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple ,562 | f u l l Otpisd>((tciodm)m,, natlhgroe,a dwso(rnkt)h;r e\a d s| ) ^, tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hI:n165B:l33o:c knote: (uninitialized use occurs heret hread I165d | x . x ) ,c ogpryoTuopS(hgmreomu8p()t,i d %| W ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~A R P| _ tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)S IZE, d563s | t , s rsct,e pbSyitzees()n;c c l| S ^~~h mem.c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:m162m:.5b:u fwarning: fvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]S iz e162s | [ N C C Ld_ePfRaOuTlOt_:S I M| P ^~~~~~~L E]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/:N165C:C33L:_ Snote: Tuninitialized use occurs hereE PS/s i165z | e o f ( Tc)o)p y{T o S| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m e m| 8 group(group( tid%WARP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:S217I:Z57E:, note: din instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested heres t, s r217c | , bPyrtiemsi)t;i v e| s ^~~< T, RedOp, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :f134a:l14s:e )note: ;initialize the variable 'dst' to silence this warning | ^ 134 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 405v:o3i:d note: *expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'd st, *src ;405 | | ^m s c| c = nullptrl RunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tiIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp(:t1i: dIn file included from )/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 13n: tIn file included from h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hr:e167a: d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hs:(562n:t15h:r ewarning: ainitializer order does not match the declaration order [-Wreorder-ctor]d s), tidI n562B | l o c k (ttihdr(etaiddI)d,x .nxt)h,r egardosu(pn(tghrroeuapd)s,) , | t ^~~~~~~~~~~~~~~~~i dIn/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hB:l562o:c60k:( tnote: hfield 'group' will be initialized after field 'stepSize'r eadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~d s (| n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread s563) | , t i dsItneBplSoiczke((tnhcrcelaSdhImdexm..xc)o,m mg.rbouufpf(Sgirzoeusp[)N,C C L| _ ^~~~~~~~~~~P ROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group),In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 154 :t10i:d (warning: tvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]i d), nt h154r | e a d s (cnatsher e3a:d s )| , ^ tidInBlock(th/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cppr:e5a:d9I:d xnote: .in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herex ), gro u5p | ( g r o u p ) , M S| C ^~~~~~~~~~~C L_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 15 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, wor/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hk); \: 154 :| 10 ^: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15 :154 | note: field 'nthreads' will be initialized after field 'tidInBlock' cas e562 | 3 : | t ^i d(tid), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpph:r5e:a9d:s (note: nin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested heret hrea d5s | ) , t i d I n BMlSoCcCkL(_tIhMrPeLa_dKIEdRxN.ExL)_,E NgTrRoYu_pF(UgNrCo_uDpE)V,R E D| O ^~~~~~~~~~~~~~~~~P _TY/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hP:E562(:S60u:m ,note: field 'group' will be initialized after field 'stepSize'i nt64 _562t | , f a ltsied)(;t i d| ) ^, nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:e402a:d3s:( nnote: texpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'h read s402) | , tmisdcIcnlBRluoncIkn(ttehrrperaedtIedrx<.txy)p,e ,g rFouunpc(#g#rdoeuvpr)e,d o p| < ^~~~~~~~~~~t ype>, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ (tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 89%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr rotoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h vo:i154d: 10*:d swarning: tvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized], *src; 154 | | ^ | c = nullptra se 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primiti 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ves, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthrea/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] ds(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :154:10: warning: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]: 162:5 :154 | warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] ca s162e | 3 : d| e ^f ault:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp : 5| : ^~~~~~~9 : note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hin instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here: 165:33: 5note: | uninitialized use occurs here 165 | M S C C Lc_oIpMyPTLo_SKhEmReNmE8L(_tEiNdT%RWYA_RFPU_NSCI_ZDEE,V RdEsDtO,P _sTrYcP,E (bSyutme,s )h;a l f| , ^~~ false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :f134u:l14l:O pnote: sinitialize the variable 'dst' to silence this warning> (com m134, | a l g ov,o iwdo r*kd)s;t ,\ * s| r ^c ; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^165 : 33| : = nullptr note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 134| : stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)14 : note: initialize the variable 'dst' to silence this warning 508 | 134 | w a r p IvnoBildo c*kd(stth,r e*asdrIcd;x . x| / ^W A R| P = nullptr_ SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp217::157: :In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hnote: :in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here13 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167 : 217/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 :P15r:i mwarning: iinitializer order does not match the declaration order [-Wreorder-ctor]t ives(,n t1h,r ePardost)o,, t0i>d IpnrBilmosc k (| t ^h readI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cppd:x5.:x9):, note: gin instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested herer oup(g r5o | u p ) , | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ M S| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_IM P563L | _ K E R NsEtLe_pESNiTzReY(_nFcUcNlCS_hDmEeVmR.EcDoOmPm_.TbYuPfEf(SSiuzme,s [hNaClCfL,_ PfRaOlTsOe_)S;I M P| L ^E ]/NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ CL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncSum<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(In file included from comm, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cppa:l1g: o,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :w154o:r10k:) ;warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]\ | ^ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 90%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp| : ^~~~~~~~~~~1 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Sum, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | defaIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217note: :initialize the variable 'dst' to silence this warning57 : note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here134 | voi d217 | * d sPtr,i m*istricv;e s <| T ^, R| e = nullptrd Op, FanAsymmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreterfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor], ProtoS i507m | p l e < MtSiCdC(Lt_iCdH)U,N KnStThErPeSa/dMsS(CnCtLh_rSeLaIdCsE)S,T EwPiSd,( tMiSdC%CWLA_RSPL_ISCIEZSET)E,P Sw>a,r pf(utlildO/pWsA>R(Pc_oSmImZ,E )a,l g o| , ~~~~~~~~~~~~~~~~~~ w o| r stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)k ); \ 508 | | ^ warpI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o15c:k (note: tfield 'nthreads' will be initialized after field 'tidInBlock'h readI d562x | . x / W AtRiPd_(StIiZdE)),, n t| h ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r e a| d warp(tid/WARP_SIZEs (nth r509e | a d s ) ,f ltaigdTIhnrBelaodc(k((ttihdr%e4a)d=I=d3x).,x )g,r ogurpo(ugpr(ogurpo)u,p ) ,| ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | ^~~~~~~~~~~~~~~~~| warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562 :51060 | : note: field 'group' will be initialized after field 'stepSize' stepSi z562e | ( n c c ltSihdm(etmi.dc)o,m mn.tbhurfefaSdisz(enst[hNrCeCaLd_sP)R,O TtOi_dLILn1B2l8o]c/kN(CtChLr_eSaTdEIPdSx/.sxi)z,e ogfr(ouuipn(tg6r4o_utp))), { | ^~~~~~~~~~~| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uintIn file included from 8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp_:t1,: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hf:a154l:s10e:) ;warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] | ^ 154/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h | : 402 : 3 :c anote: sexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'e 3: | 402 ^ | mscclRunI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cppn:t5e:r9p:r enote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested heree rK,E RPNrEoLt_oELNLT1R2Y8_,F UfNuCl_lDOEpVsR>E(DcOoPm_mT,Y PaEl(gPor,o dw,o ruki)n;t 8\_ t ,| ^f alse); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ult: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ erSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOfIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ fset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.ht:i386d:;9 : | warning: ^variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hmm,: 154a:l10g:o ,warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]w ork); 154\ | | ^ case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | f l a gsTtherpeSaidz(e((tnicdc%l4S)h=m=e3m).,c ogmrmo.ubpu(fgfrSoiuzpe)s,[ N C| C ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~L _ P| R warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3O TO_LL128] /510N | C C L _ SsTtEePpSS/isziez(enocfc(luSihnmte6m4._cto)m)m .{b u f| f ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~S i z| e group(groups [NCCL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hL:L2171:2578:] /note: Nin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereC CL_S T217E | P S /Psriizmeiotfi(vueisn, 1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 217P:r57o:t onote: ,in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 0> pr i217m | s P| r ^i mitive/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpps:<5T:,9 :R enote: din instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereO p, F a5n | A s y m m e t r iMcSI,M P1L,_ KPErRoNtEoL,_ E0N>T RpYr_iFmUsN C _| D ^E VREDOP_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cppT:Y5P:E9(:P rnote: oin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hered , ui n5t | 6 4 _ t , f a lMsSeC)C;L _ I| M ^P L_KE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hR:N402E:L3_:E Nnote: Texpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'R Y_FUN C402_ | D E VmRsEcDcOlPR_uTnYIPnEt(ePrrporde,t euri402,: 3P:r onote: texpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'o LL128 ,402 | f u lmlsOcpcsl>R(ucnoImnmt,e raplrgeot,e rw, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWorke,a dNsC(CnLt_hArLeGaOd_s#), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ #algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx908. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx941. 13 warnings generated when compiling for gfx90a. 13 warnings generated when compiling for gfx803. 13 warnings generated when compiling for gfx900. 13 warnings generated when compiling for gfx1100. 13 warnings generated when compiling for gfx940. 13 warnings generated when compiling for gfx1030. 13 warnings generated when compiling for gfx1102. 13 warnings generated when compiling for gfx906. 13 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, SumPostDiv, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 13 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPLIn file included from _KERNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cppL:_1E: NT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hR:Y154_:F10U:N Cwarning: _variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]D EVREDOP_TY P154E | ( P r o dc,a sien t36:4 _ t| , ^ false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cppnote: :expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'5 :9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 405 | 5 | m s c c l R u n IMnStCeCrLp_rIeMtPeLr__,T YPPrEo(tPorSoidm,p lienn,I nftuelrlpOrpest>e(rc,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :P562r:o15t:o Lnote: Lfield 'nthreads' will be initialized after field 'tidInBlock', ful l562O | p s > ( ctoimdm(,t iadl)g,o ,n twhorreka)d;s (\n t h| r ^e ads), ti/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:I165n:B33l:o cnote: kuninitialized use occurs here( thread I165d | x . x ) ,c ogpryoTuopS(hgmreomu8p()t,i d %| W ^~~~~~~~~~~~~~~~~A RP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h_:S562I:Z60E:, note: dfield 'group' will be initialized after field 'stepSize's t, s r562c | , b y tteisd)(;t i d| ) ^~~, nthreads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hh:r162e:a5d:s )warning: ,variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] tid I162n | B l o c kd(etfharuelatd:I d x| . ^~~~~~~x ),/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :g165r:o33u:p (note: guninitialized use occurs herer oup) ,165 | | ^~~~~~~~~~~ copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 91%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPIn file included from t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cppr:(10: )In file included from +/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hl:l131: 2In file included from 8/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hO:f168f: s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.he:t153;: 14 :| ^~~warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hPS/sizeo:f134(:T14):) note: { initialize the variable 'dst' to silence this warning | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 134 | void */usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:s217t:,57 :* snote: rin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested herec ; | ^ | 217 = nullptr | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr 15 warnings generated when compiling for host. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t dataIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :165 | 154 : 10 : cwarning: ovariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]p yToShm e154m | 8 ( t i dc%aWsAeR P3_:S I Z| E ^, dst, sr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cppc:,5 :b9y:t enote: sin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here) ; | ^~~5 | MSC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hC:L162_:I5M:P Lwarning: _variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]K ERN E162L | _ E N T RdYe_fFaUuNlCt_:D E V| R ^~~~~~~E DO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hP:_165T:Y33P:E (note: Puninitialized use occurs herer od, d165o | u b l e ,c ofpaylTsoeS)h;m e m| 8 ^( tid%/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hW:A405R:P3_:S Inote: Zexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'E , dst, 405s | r c ,m sbcyctleRsu)n;I n t| e ^~~r preter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hW:A134R:P14_:S Inote: Zinitialize the variable 'dst' to silence this warningE , d s134t | , s r cv,o ibdy t*edss)t;, *| s ^~~r c; | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 162| : = nullptr5 : warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h167:: 134/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::14562:: 15note: :initialize the variable 'dst' to silence this warning warning: initializer order does not match the declaration order [-Wreorder-ctor] 134 | v o562i | d * d stti,d (*tsirdc);, n| t ^h r e| a = nullptrd s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives , 1, Prot o562, | 0 > ptriidm(st i d| ) ^, nthre/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cppa:d5s:(9n:t hnote: rin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested heree ads) ,5 | t i d I n B l o cMkS(CtChLr_eIaMdPILd_xK.ExR)N,E Lg_rEoNuTpR(Yg_rFoUuNpC)_,D E V| R ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~E D O| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)_ TYPE( P563r | o d , dsotuebplSei,z ef(anlcscel)S;h m e| m ^. comm.b/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hu:f405f:S3i:z enote: sexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'[ NCCL_PROT O405_ | S I MmPsLcEc]l/RNuCnCILn_tSeTrEpPrSe/tseirz/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 217P:r57o:t onote: Sin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested herei mple <217M | S C CPLr_iCmHiUtNiKvSeTsEL,I C1E,S TPErPoSt>o,, f0u>l lpOrpism>s( c o| m ^m , alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cppo:,5 :w9o:r knote: )in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here; \ | 5 ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562M:S15C:C Lnote: _field 'nthreads' will be initialized after field 'tidInBlock'I MPL_KE R562N | E L _ E NtTiRdY(_tFiUdN)C,_ DnEtVhRrEeDaOdPs_(TnYtPhEr(ePardosd),, dtoiudbIlneB,l ofcakl(steh)r;e a d| I ^d x.x)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h,: 405g:r3o:u pnote: (expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'g roup), 405 | | ^~~~~~~~~~~~~~~~~ mscc/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hl:R562u:n60I:n tnote: efield 'group' will be initialized after field 'stepSize'r preter <562t | y p e , tFiudn(ct#i#dd)e,v rnetdhorpet,h rPeraodtso)S,i mtpildeI, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 92%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h386::139: :In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hwarning: :variable 'wireOffset' set but not used [-Wunused-but-set-variable]168 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14 :386 | warning: unused variable 'data1' [-Wunused-variable] int wireOf f153s | e t = uWiinrte3W2o_rtd PdeartSal1i,c ef*lwaagr1p, +d a2t*aw2i,d ;f l a| g ^2 ; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | ca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ se 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WA/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncProd<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter15 warnings generated when compiling for host. , ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBloIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ck(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ri/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ng->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hA:L562G:O15_:# #warning: ainitializer order does not match the declaration order [-Wreorder-ctor]l go, NCCL _562P | R O T O _t#i#dp(rtoitdo)>,( )n.trhurne(a&dnsc(cnltShhrmeeamd.sw)o,r kt)i;d I\n B l| o ^c k(thr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:a562d:I15d:x .note: xfield 'nthreads' will be initialized after field 'tidInBlock') , gro u562p | ( g r o utpi)d,( t i| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~) , | n tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)t hread s563( | n t h r esatdesp)S,i ztei(dnIcncBllSohcmke(mt.hcroemamd.Ibduxf.fxS)i,z egsr[oNuCpC(Lg_rPoRuOpT)O,_ S I| M ^~~~~~~~~~~~~~~~~P LE]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h/:N562C:C60L:_ Snote: Tfield 'group' will be initialized after field 'stepSize'E PS/si z562e | o f ( T )t)i d{( t i| d ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~) , | n group(groupt hreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hn:t33h:r7e:a dnote: sin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here) , tidIn B33l | o c k ( t h rperaidmIsd(xt.ixd),, ngtrhoruepa(dgsr,o u&pr)i,n g -| > ^~~~~~~~~~~p rev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ p, Proto>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ m.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562(:t15i:d )warning: ,initializer order does not match the declaration order [-Wreorder-ctor] nthreads( n562t | h r e a dtsi)d,( ttiidd)I,n Bnltohcrke(atdhsr(enatdhIrdexa.dxs)),, gtrioduInpB(lgorcoku(pt)h,r e a| d ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~I d x| . tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)x ), gr o563u | p ( g r osutpe)p,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~( n c| c tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)l Shmem .563c | o m m . bsutfefpSSiizzees([nNcCcClLS_hPmReOmT.Oc_oSmImM.PbLuEf]f/SNiCzCeLs_[SNTCECPLS_/PsRiOzTeOo_fS(ITM)P)L E{] / N| C ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C L _| S group(groupT EPS/sizeof/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h(:T33):)7 :{ note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group 33 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :p33r:i7m:s (note: tin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herei d, nth r33e | a d s , & rpirnigm-s>(ptriedv,, n&trhirnega-d>sn,e x&tr,i nagr-g>sp-r>esve,n d&bruifnfg,- >anregxst-,> raercgvsb-u>fsfe,n dabrugfsf-,> raerdgOsp-A>rrge,c v0b,u fafr,g sa-r>gcso-n>nrIenddOepxA,r ga,r g0s,- >acrognsn-I>ncdoenxn)I;n d e| x ^, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h>:c78o:n5n:I nnote: din instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree x); 78| | ^ ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hn:R78i:n5g:< Tnote: ,in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here Red O78p | , P r ortuon>R(ianrgg:(202a:r53g:s )note: ;in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | ^ 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 : Rnote: uin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heren Wor k202E | l e m e n t < F nR,u nTW,o rRkeEdlOepm,e nAtld(O)p.,r uAnl(gwoe,) ;P r o| t ^o >().run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp(:w11e:)1;: note: | in instantiation of member function 'RunWork, 1, 2>::run' requested here ^ 11 | IM/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cppP:L12_:C1O:L Lnote: _in instantiation of member function 'RunWork, 1, 2>::run' requested hereF UNC( R12e | dIuMcPeLS_cCaOtLtLe_rF,U NRCI(NRGe,d uScIeMSPcLaEt,t ePrr,o dR,I NfGl,o aStI)M P L| E^, Prod/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h,: 391d:o95u:b lnote: eexpanded from macro 'IMPL_COLL_FUNC') | ^ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 391R:u95n:W onote: rexpanded from macro 'IMPL_COLL_FUNC'k n,c #N#CdCeLv_rAeLdGoOp_<#t#yapleg>o,, NNCCCCLL__APLRGOOT_O#_##a#lpgroo,t oN>C(C)L._rPuRnO(T&On_c#c#lpSrhomteom>.(w)o.rrku)n;( &\n c c| l ^S hmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hw:o562r:k15):; note: \field 'nthreads' will be initialized after field 'tidInBlock' | ^ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 :t15i:d (note: tfield 'nthreads' will be initialized after field 'tidInBlock'i d), n t562h | r e a d st(indt(htrieda)d,s )n,t htriedaIdnsB(lnotchkr(etahdrse)a,d Itdixd.Ixn)B,l ogcrko(utph(rgeraoduIpd)x,. x )| , ^~~~~~~~~~~~~~~~~ gro/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hu:p562(:g60r:o unote: pfield 'group' will be initialized after field 'stepSize') , | ^~~~~~~~~~~~~~~~~ 562 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562t:i60d:( tnote: ifield 'group' will be initialized after field 'stepSize'd ), nt h562r | e a d s (tidn(tthirde)a,d sn)t,h rteiaddIsn(Bnltohcrke(atdhsr)e,a dtIiddxI.nxB)l,o cgkr(otuhpr(egardoIudpx).,x ) ,| ^~~~~~~~~~~g roup(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1030. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx906. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx803. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncProd<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncProd<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Prod, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 86%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h 514 | : 562 : 15i:n twarning: initializer order does not match the declaration order [-Wreorder-ctor]o ffset = tid ;562 | | ^ tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ reads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h7::562 :note: 15in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here: warning: initializer order does not match the declaration order [-Wreorder-ctor] 33 | 562 | p r i m st(itdi(dt,i dn)t,h rnetahdrse,a d&sr(inntgh-r>epardesv),, &triidnIgn-B>lnoecxkt(,t harregasd-I>dsxe.nxd)b,u fgfr,o uapr(ggsr-o>urpe)c,v b u| f ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~f , | a tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)r gs-> r563e | d O p A rsgt,e p0S,i zaer(gnsc-c>lcSohnmneImn.dceoxm,m .abrugfsf-S>iczoensn[INnCdCeLx_)P;R O T| O ^_ SIMPLE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h]:/78N:C5C:L _note: Sin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereT EPS /78s | i z e o fr(uTn)R)i n{g < T| , ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ R e| d group(groupO p, Proto>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h(:a33r:g7s:) ;note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here | ^ 33 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h : 202 : 53 :p rnote: iin instantiation of member function 'RunWorkElement, 1, 2>::run' requested herem s(ti d202, | n t h r e a d sR,u n&WroirnkgE-l>epmreenvt,< F&nr,i nTg,- >RneedxOtp,, aArlggso-,> sPernodtbou>f(f),. raurng(sw-e>)r;e c v| b ^u ff, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppg:s7-:>1r:e dnote: Oin instantiation of member function 'RunWork, 1, 2>::run' requested herep Arg, 70 | ,I MaPrLg_sC-O>LcLo_nFnUINnCd(eRxe,d uacregSsc-a>tctoenrn,I nRdIeNxG),; S I| M ^P LE, Su/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hm:,78 :u5i:n tnote: 3in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here2 _t) 78 | | ^ ru/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:R391i:n95g:< Tnote: ,expanded from macro 'IMPL_COLL_FUNC' RedOp, 391P | r o tRou>n(Waorrgks<)n;c c l| F ^u nc##fu/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:c202,: 53t:y pnote: ein instantiation of member function 'RunWorkElement, 1, 2>::run' requested here, Fun c202# | # d e v r e d o pRk,E lNeCmCeLn_tAr(o)t.or>u(n)(.wreu)n;( & n| c ^c lShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.:w8o:r1k:) ;note: in instantiation of member function 'RunWork, 1, 2>::run' requested here\ | ^ 8 | IMP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hL:_562C:O15L:L _note: Ffield 'nthreads' will be initialized after field 'tidInBlock'U NC(Red u562c | e S c a tttiedr(,t iRdI)N,G ,n tShIrMePaLdEs,( nStuhmr,e aidnst)6,4 _tti)d I n| B^l ock(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:h391r:e95a:d Inote: dexpanded from macro 'IMPL_COLL_FUNC'x .x), gr o391u | p ( gRruounpW)o,r k <| n ^~~~~~~~~~~~~~~~~c clFun/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hc:#562#:f60u:n cnote: ,field 'group' will be initialized after field 'stepSize' type, F562u | n c # # dteivdr(etdiodp)<,t ynpteh>r,e aNdCsC(Ln_tAhLrGeOa_d#s#)a,l gtoi,d INnCBClLo_cPkR(OtThOr_e#a#dpIrdoxt.ox>)(,) .grruonu(p&(ngcrcoluSph)m,e m .| w ^~~~~~~~~~~o rk); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##alg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRoi,n gNo(>a(r)g.sr)u;n ( &| n ^c clShmem.work);/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :\202 : 53| : ^ note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562 : 15 : note: Rfield 'nthreads' will be initialized after field 'tidInBlock'u nWorkEle m562e | n t < F nt,i dT(,t iRde)d,O pn,t hArlegaod,s (Pnrtohtroe>a(d)s.)r,u nt(iwdeI)n;B l o| c ^k (threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppd:I9d:x1.:x )note: ,in instantiation of member function 'RunWork, 1, 2>::run' requested here grou p9( | gIrMoPuLp_)C,O L L| _ ^~~~~~~~~~~~~~~~~F UNC(R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.he:d562u:c60e:S cnote: afield 'group' will be initialized after field 'stepSize't ter, R I562N | G , S ItMiPdL(Et,i dS)u,m ,n tuhirneta6d4s_(tn)t h r| e^a ds), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ht:i391d:I95n:B lnote: oexpanded from macro 'IMPL_COLL_FUNC'c k(thre a391d | I d xR.uxn)W,o rgkr, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ izeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ).run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement() .562r | u n ( w et)i;d ( t| i ^d ), nthreads(nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cppr:e10a:d1s:) ,note: in instantiation of member function 'RunWork, 1, 2>::run' requested heret idInB l10o | cIkM(PtLh_rCeOaLdLI_dFxU.NxC)(,R egdruocuepS(cgartotuepr),, R I| N ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~G , | S tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)I MPLE, 563S | u m , hsatlefp)S i z| e^( ncclSh/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hm:e391m:.95c:o mnote: mexpanded from macro 'IMPL_COLL_FUNC'. buffSiz e391s | [ N CRCuLn_WPoRrOkT, NCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h_:A33L:G7O:_ #note: #in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herea lgo, N C33C | L _ P R O T Op_r#i#mpsr(ottiod>,( )n.trhurne(a&dnsc,c l&Srhimnegm-.>wporrekv),; &\r i n| g ^- >next, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hr:g562s:-15>:s enote: nfield 'nthreads' will be initialized after field 'tidInBlock'd buff, a562r | g s - > rteicdv(btuifdf),, anrtghsr-e>ardesd(OnptAhrrge,a d0s,) ,a rtgisd-I>ncBolnoncIkn(dtehxr,e aadrIgdsx-.>xc)o,n ngIrnoduepx()g;r o u| p ^) , | ^~~~~~~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h :78:5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:: 562note: :in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here60 : note: field 'group' will be initialized after field 'stepSize'78 | r562u | n R i n gts((anrtghsr)e;a d s| ) ^, tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hn:B202l:o53c:k (note: tin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereh read I202d | x . x ) , g r oRuupn(WgorrokuEpl)e,m e n| t ^~~~~~~~~~~< Fn, T, RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ catter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatargs->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ educeScatter, RING, SIMPLE, Sum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncSum<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncSum<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Sum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested herereads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: 562note: | in instantiation of member function 'RunWork, 1, 2>::run' requested here ti d11( | tIiMdP)L,_ CnOtLhLr_eFaUdNsC((nRtehdruecaedSsc)a,t tteird,I nRBIlNoGc,k (StIhMrPeLaEd,I dMxa.xx,) ,f lgoraotu)p ( g| r^o up), | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~: 391 :| 95 tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T): note: expanded from macro 'IMPL_COLL_FUNC' 563 | s t391e | p S iRzuen(WnocrcklC,L _NSCTCELP_SA/LsGiOz_e#o#fa(lTg)o), {N C C| L ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~_ P R| O group(groupT O_##proto>(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h):.33r:u7n:( ¬e: nin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herec clShmem .33w | o r k ) ; \p r i| m ^s (tid, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dnote: sfield 'nthreads' will be initialized after field 'tidInBlock', &ring -562> | p r e v ,t i&dr(itnigd-)>,n enxtth,r eaardgss(-n>tshernedabdusf)f,, tairdgIsn-B>lroecckv(btuhfrfe,a daIrdgxs.-x>)r,e dgOrpoAurpg(,g r0o,u pa)r,g s -| > ^~~~~~~~~~~~~~~~~c onnI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:d562e:x60,: anote: rfield 'group' will be initialized after field 'stepSize'g s->co n562n | I n d e xt)i;d ( t| i ^d ), nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hh:r78e:a5d:s (note: nin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heret hre a78d | s ) , triudnIRniBnlgo (garrogusp)(;g r o| u ^p ), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ Op, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:t562h:r15e:a dwarning: sinitializer order does not match the declaration order [-Wreorder-ctor]) , tidInBl o562c | k ( t h rteiadd(Itdixd.)x,) ,n tghrroeuapd(sg(rnotuhpr)e,a d s| ) ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~, t| i tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)d InBlo c563k | ( t stehprSeiazdeI(dnxc.cxl)S,h mgermo.ucpo(mgmr.obuupf)f,S i z| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~s [ N| C tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)C L_PRO T563O | _ S I M PsLtEe]p/SNiCzCeL(_nScTcElPSSh/mseimz.ecoofm(mT.)b)u f{f S i| z ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e s [| N group(groupC CL_PROTO_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hS:I33M:P7L:E ]note: /in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereN CCL_ST E33P | S / s i z e opfr(iTm)s)( t{i d ,| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n t h| r group(groupe ads, &rin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hg:-33>:p7r:e vnote: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here &ring-> n33e | x t , a r gpsr-i>msse(ntdibdu,f fn,t harregasd-s>,r e&crvibnugf-f>,p raervg,s -&>rriendgO-p>Anregx,t ,0 ,a ragrsg-s>-s>ecnodnbnuIfnfd,e xa,r gasr-g>sr-e>ccvobnunfIfn,d eaxr)g;s - >| r ^e dOpArg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h,: 780:,5 :a rnote: gin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heres ->c o78n | n I n d erxu,n Rairnggs<-T>,c oRnendIOnpd,e xP)r;o t o| > ^( args)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h;: 78 :| 5 ^: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h78: | 202 : 53 : rnote: uin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heren R i202n | g < T , R e d ORpu,n WPorroktEol>e(maerngts<)F;n , | T ^, RedOp,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h :A202l:g53o:, note: Pin instantiation of member function 'RunWorkElement, 1, 2>::run' requested herer oto> (202) | . r u n ( w e ) ;R u n| W ^o rkEleme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppn:t12<:F1n:, note: Tin instantiation of member function 'RunWork, 1, 2>::run' requested here, Re d12O | pI,M PALl_gCoO,L LP_rFoUtNoC>((R)e.druucne(Swcea)t;t e r| , ^ RING, S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppI:M12P:L1E:, note: Min instantiation of member function 'RunWork, 1, 2>::run' requested herea x, d o12u | bIlMeP)L _ C| O^L L_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hC:(391R:e95d:u cnote: eexpanded from macro 'IMPL_COLL_FUNC'S catter ,391 | R I NRGu,n WSoIrMkP | , NRCuCnLW_oArLkGd(e)v.rreudno(p&h,m eNmC.CwLo_rAkL)G;O _\# # a| l ^g o, NC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hC:L562_:P15R:O Tnote: Ofield 'nthreads' will be initialized after field 'tidInBlock'_ ##pro t562o | > ( ) . rtuind((&tnicdc)l,S nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ hmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :&562r:i15n:g -warning: >initializer order does not match the declaration order [-Wreorder-ctor]p rev, &rin g562- | > n e x tt,i da(rtgisd-)>,s enntdhbruefafd,s (anrtghsr-e>ardesc)v,b utfifd,I naBrlgosc-k>(rtehdrOepaAdrIgd,x .0x,) ,a rggrso-u>pc(ognrnoIunpd)e,x , | a ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r g s| - tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)> connI n563d | e x ) ; s t| e ^p Size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h(:n78c:c5l:S hnote: min instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested heree m.c o78m | m . b u frfuSniRziensg[E(]a/rNgCsC)L;_ S T| E ^P S/size/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.ho:f202(:T53):) note: {in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 202| | group(group R/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hu:n33W:o7r:k Enote: lin instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested heree menti(d),. rnutnh(rweea)d;s , | & ^r ing->p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppr:e13v:,1 :& rnote: iin instantiation of member function 'RunWork, 1, 2>::run' requested heren g->n e13x | tI,M PaLr_gCsO-L>Ls_eFnUdNbCu(fRfe,d uacregSsc-a>trteecrv,b uRfIfN,G ,a rSgIsM-P>LrEe,d OMpaAxr,g ,r c0c,l _abrfglso-a>tc1o6n)n I n| d^e x, ar/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hg:s391-:>95c:o nnote: nexpanded from macro 'IMPL_COLL_FUNC'I ndex); 391 | | ^ Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hW:o78r:k5<:n cnote: cin instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested herel Fun c78# | # f u n cr,u ntRyipneg,< TF,u nRce#d#Odpe,v rPerdootpo<>t(yapreg>s,) ;N C C| L ^_ ALGO_#/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h#:a202l:g53o:, note: Nin instantiation of member function 'RunWorkElement, 1, 2>::run' requested hereC CL_P R202O | T O _ # # p r o tRou>n(W)o.rrkuEnl(e&mnecnctl15(:) .note: rfield 'nthreads' will be initialized after field 'tidInBlock'u n(we) ;562 | | ^ tid(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cppt:i8d:)1,: nnote: tin instantiation of member function 'RunWork, 1, 2>::run' requested hereh read s8( | nItMhPrLe_aCdOsL)L,_ FtUiNdCI(nRBeldouccke(StchartetaedrI,d xR.IxN)G,, gSrIoMuPpL(Eg,r oMuapx),, i n| t ^~~~~~~~~~~~~~~~~6 4_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:)562 : 60| :^ note: field 'group' will be initialized after field 'stepSize' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391: 95562: | note: expanded from macro 'IMPL_COLL_FUNC' tid(t i391d | ) , RnutnhWroerakdp,( gNrCoCuLp_)A,L G O| _ ^~~~~~~~~~~# #algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement (t)i.dr(utni(dw)e,) ;n t h| r ^e ads(nt/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpph:r4e:a1d:s )note: ,in instantiation of member function 'RunWork, 1, 2>::run' requested here tid I4n | BIlMoPcLk_(CtOhLrLe_aFdUINdCx(.Rxe)d,u cgerSocuapt(tgerro,u pR)I,N G ,| ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~S I M| P tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)L E, Ma x563, | i n t 8s_tte)p S i| z^e (nccl/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hS:h391m:e95m:. cnote: oexpanded from macro 'IMPL_COLL_FUNC'm m.buf f391S | i z eRsu[nNWCoCrLk_ ,| group(groupN CCL_ALGO/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h_:#33#:a7l:g onote: ,in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here NCCL_PRO T33O | _ # # p r o tpor>i(m)s.(rtuind(,& nnctchlrSehamdesm,. w&orrikn)g;- >\p r e| v ^, &rin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hg:-562>:n15e:x tnote: ,field 'nthreads' will be initialized after field 'tidInBlock' args- >562s | e n d b utfifd,( tairdg)s,- >nrtehcrvebaudfsf(,n tahrrgesa-d>sr)e,d OtpiAdrIgn,B l0o,c ka(rtghsr-e>acdoIndnxI.nxd)e,x ,g raorugps(-g>rcoounpn)I,n d e| x ^~~~~~~~~~~~~~~~~) ; | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h ^: 562:60: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hnote: :field 'group' will be initialized after field 'stepSize'78 :5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here562 | 78 | t i d ( triudn)R,i nngt,( atrigdsI)n;B l o| c ^k (threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hd:I202d:x53.:x )note: ,in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here grou p202( | g r o u p ) , R| u ^~~~~~~~~~~n WorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:In file included from 33:7/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:: 1note: : in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested hereIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h: 56233: | 15 : warning: initializer order does not match the declaration order [-Wreorder-ctor] prims(tid, nt h562r | e a d s ,t i&dr(itnigd-)>,p rnetvh,r e&ardisn(gn-t>hnreexatd,s )a,r gtsi-d>IsneBnldobcukf(ft,h raeragdsI-d>xr.exc)v,b ugfrfo,u pa(rggrso-u>pr)e,d O p| A ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~r g ,| tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)0 , ar g563s | - > c o nsntIenpdSeixz,e (anrcgcsl-S>hcmoenmn.Icnodmemx.)b;u f f| S ^i zes[NCCL_PROT/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.hO:_78S:I5M:P Lnote: Ein instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here] /NC C78L | _ S T E PrSu/nsRiiznego(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h33::2027::53 :note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested herenote: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 20233 | | p rRiumnsW(otrikdE,l enmtehnrtepp,r eAvl,g o&,r iPnrgo-t>on>e(x)t.,r uanr(gwse-)>;s e n| d ^b uff, arg/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpps:-9>:r1e:c vnote: bin instantiation of member function 'RunWork, 1, 2>::run' requested hereu ff, a9r | gIsM-P>Lr_eCdOOLpLA_rFgU,N C0(,R eadrugcse-S>ccaotntneIrn,d eRxI,N Ga,r gSsI-M>PcLoEn,n IMnadxe,x )u;i n t| 6 ^4 _t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h::78391::595:: note: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested hereexpanded from macro 'IMPL_COLL_FUNC' 78 | 391 | rRuunnRWionrgk< (tayrpges,) ;F u n| c ^# #devr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.he:d202o:p53<:t ynote: pin instantiation of member function 'RunWorkElement, 1, 2>::run' requested heree >, N202C | C L _ A L G O _ #R#uanlWgoor,k ENlCeCmLe_nPtRp(,) .Arlugno(,& nPcroto>c(l)S.hrmuenm(.wweo)r;k ) ;| ^\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hnote: :in instantiation of member function 'RunWork, 1, 2>::run' requested here562 :15: note: field 'nthreads' will be initialized after field 'tidInBlock' 4 | 562I | M P L _ CtOiLdL(_tFiUdN)C,( RnetdhurceeaSdcsa(tnttehrr,e aRdIsN)G,, tSiIdMIPnLBEl,o cMka(xt,h rienatd8I_dtx). x )| ,^ grou/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.hp:(391g:r95o:u pnote: )expanded from macro 'IMPL_COLL_FUNC', | ^~~~~~~~~~~~~~~~~ 391 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h : 562R:u60n:W onote: rfield 'group' will be initialized after field 'stepSize'k i,d INnCBClLo_cAkL(GtOh_r#e#aadlIgdox,. xN)C,C Lg_rPoRuOpT(Og_r#o#uppr)o,t o >| ( ^~~~~~~~~~~) .run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1101. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx1100. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1030. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMax<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Max, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hi:d562I:n15B:l owarning: cinitializer order does not match the declaration order [-Wreorder-ctor]k (threadId x562. | x ) , gtriodu(pt(igdr)o,u pn)t,h r e| a ^~~~~~~~~~~d s(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , RedOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(groIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ up), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ edOp, Algo, Proto>().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem./usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(Ring(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx900. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx906. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncMin<__half>, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, Min, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 87%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ tepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h(:n562t:h15r:e awarning: dinitializer order does not match the declaration order [-Wreorder-ctor]s ), tidInB l562o | c k ( t htrieda(dtIiddx).,x )n,t hgrreoaudps((gnrtohurpe)a,d s )| , ^~~~~~~~~~~~~~~~~ tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hn:B562l:o60c:k (note: tfield 'group' will be initialized after field 'stepSize'h readIdx .562x | ) , g rtoiudp((tgirdo)u,p )n,t h r| e ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~a d s| ( tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)n thre a563d | s ) , tsitdeIpnSBilzoec(kn(ctchlrSehamdeImd.xc.oxm)m,. bgurfofuSpi(zgerso[uNpCC)L,_ P R| O ^~~~~~~~~~~T O_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :391562 | : 15 :R uwarning: ninitializer order does not match the declaration order [-Wreorder-ctor]W ork ,t iNdCICnLB_lAoLcGkO(_t#h#raelagdoI,d xN.CxCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ roto>(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, a/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), grrgs->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ eads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args-/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ >sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpAr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ g, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(gr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ oup), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx908. 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx941. 17 warnings generated when compiling for gfx940. 17 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx90a. 17 warnings generated when compiling for gfx1030. 17 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for gfx1102. 17 warnings generated when compiling for gfx1100. /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 17 warnings generated when compiling for gfx803. 17 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:10: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:4:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 4 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:5:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 5 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint8_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:6:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 6 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:7:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 7 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint32_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:8:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 8 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, int64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:9:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 9 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, uint64_t) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives<__half, FuncPreMulSum, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing<__half, FuncPreMulSum, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:10:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 10 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, half) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:11:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 11 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, float) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:12:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 12 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, double) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:33:7: note: in instantiation of member function 'Primitives, FanSymmetric<1>, 0, ProtoSimple<2, 2>, 0>::Primitives' requested here 33 | prims(tid, nthreads, &ring->prev, &ring->next, args->sendbuff, args->recvbuff, args->redOpArg, 0, args->connIndex, args->connIndex); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/reduce_scatter.h:78:5: note: in instantiation of function template specialization '(anonymous namespace)::runRing, ProtoSimple<2, 2>>' requested here 78 | runRing(args); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:202:53: note: in instantiation of member function 'RunWorkElement, 1, 2>::run' requested here 202 | RunWorkElement().run(we); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp:13:1: note: in instantiation of member function 'RunWork, 1, 2>::run' requested here 13 | IMPL_COLL_FUNC(ReduceScatter, RING, SIMPLE, PreMulSum, rccl_bfloat16) | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/common.h:391:95: note: expanded from macro 'IMPL_COLL_FUNC' 391 | RunWork, NCCL_ALGO_##algo, NCCL_PROTO_##proto>().run(&ncclShmem.work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 17 warnings generated when compiling for host. 17 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src,In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cppb:y1t: eIn file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h):;13 : In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h ^~~: 168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::14162:: 5warning: :unused variable 'data1' [-Wunused-variable] warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | 153d | e f a u luti:n t 3| 2 ^~~~~~~_ t /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:a165t:a331:, note: funinitialized use occurs herel ag1, 165d | a t a 2 ,c ofplyaTgo2S;h m e| m ^~~~~8 (tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h%:W153A:R21P:_ Swarning: Iunused variable 'flag1' [-Wunused-variable]Z E, d s153t | , s r cu,i nbty3t2e_st) ;d a t| a ^~~1 , flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h;: 134 :| 14 ^~~~~: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:F134U:N14C:_ Dnote: Einitialize the variable 'dst' to silence this warningV REDO P134_ | T Y P E (vPoriodd ,* drsctc,l _*bsfrlco;a t 1| 6 ^, f| a = nullptrl se); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Prod, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uin/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ t32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ^134 :14: note: initialize the variable 'dst' to silence this warning/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp :5:9 :134 | note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here voi d5 | * d s t , * s rMcS;C C L| _ ^I M P| L = nullptr_ KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp :| 1 ^: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: :In file included from 165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h::33169:: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hnote: :uninitialized use occurs here509 :29: warning: 165field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] | copy T507o | S h m e mt8i(dt(itdi%dW)A,R Pn_tShIrZeEa,d sd(sntt,h rseracd,s )b,y tweisd)(;t i d| % ^~~W ARP_S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hI:Z162E:)5,: wwarning: avariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]r p(t i162d | / W A R Pd_eSfIaZuEl)t,: | | ~~~~~~~~~~~~~~~~~~ ^~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :165:33: 508note: | uninitialized use occurs here w a165r | p I n B lcoocpky(TtohSrhemaedmI8d(xt.ixd/%WWAARRPP__SSIIZZEE),, d s| t ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~, s| r warp(tid/WARP_SIZEc , by t509e | s ) ; f| l ^~~a gThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.co/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ mm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TY/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ PE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp153::128: :In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hwarning: :unused variable 'data2' [-Wunused-variable]13 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h: 168153: | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153 : 14u:i nwarning: tunused variable 'data1' [-Wunused-variable]3 2_t data1, 153f | l a g 1 ,u idnatt3a22_,t fdlaatga21;, f| l ^~~~~a g1, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hd:a153t:a352:, warning: funused variable 'flag2' [-Wunused-variable]l ag2; 153 | | ^~~~~ ui/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hn:t1533:221_:t warning: dunused variable 'flag1' [-Wunused-variable]a ta1, 153f | l a g 1 ,u idnatt3a22_,t fdlaatga21;, f| l ^~~~~a g1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple13,: In file included from f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hu:l168l: O/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hp:s153>:(14c:o mwarning: munused variable 'data1' [-Wunused-variable], algo, wo r153k | ) ; \ u i| n ^t 32_t /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:a562t:a151:, note: ffield 'nthreads' will be initialized after field 'tidInBlock'l ag1, d a562t | a 2 , ftliadg(2t;i d )| , ^~~~~ nth/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.hr:e153a:d21s:( nwarning: tunused variable 'flag1' [-Wunused-variable]h rea d153s | ) , t iudiInntB3l2o_ctk (dtahtrae1a,d Ifdlxa.gx1),, dgartoau2p,( gfrloaugp2);, | | ^~~~~ ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h::153562::2860:: warning: note: unused variable 'data2' [-Wunused-variable]field 'group' will be initialized after field 'stepSize' 153 | 562 | u itnitd3(2t_itd )d,a tnat1h,r efaldasg(1n,t hdraetaad2s,) ,f ltaigd2I;n B l| o ^~~~~c k(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.ht:h153r:e35a:d Iwarning: dunused variable 'flag2' [-Wunused-variable]x .x) ,153 | g r o u pu(ignrto3u2p_)t, d a| t ^~~~~~~~~~~a 1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | In file included from void /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp*:d1s: t,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :*154s:r10c:; warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]| ^ | = nullptr 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpretIn file included from e/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppr:<1t: yIn file included from p/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:,13 : FIn file included from u/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hn:c169#: #/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hd:e509v:r29e:d owarning: pfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]< type>, P507r | o t o L Lt,i df(utlildO)p,s >n(tchormema,d sa(lngtoh,r ewaodrsk)),; w\i d (| t ^i d%WARP_SI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hZ:E165):,33 :w anote: runinitialized use occurs herep (tid /165W | A R P _ ScIoZpEy)T,o S h| m ~~~~~~~~~~~~~~~~~~e m 8| ( stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)t id%W A508R | P _ S I ZwEa,r pdIsntB,l oscrkc(,t hbryetaedsI)d;x . x| / ^~~W ARP_SIZE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h):,162 : 5| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]| warp(tid/WARP_SIZE 162 | 509 | d e f afullatg:T h r| e ^~~~~~~a d(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t165i:d33%:4 )note: =uninitialized use occurs here= 3), 165g | r o u p (cgorpoyuTpo)S,h m e| m ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~8 ( t| i warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3d %WARP _510S | I Z E , sdtsetp,S iszrec(,n cbcyltSehsm)e;m . c| o ^~~m m.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Pr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:t134o:,14 :0 >note: initialize the variable 'dst' to silence this warningp rim s134 | | ^ vo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppi:d5 :*9d:s tnote: ,in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here *src ;5 | | ^ | = nullptr MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::154217::1057:: warning: note: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217154 | | P r icmaistei v3e:s < T| , ^ RedOp, Fa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppn:A5s:y9m:m enote: tin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested herer ic<1,1> ,5 | 1 , Pr o t o , 0M>S CpCrLi_mIsM P L| _ ^K ERN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppE:L5_:E9N:T Rnote: Yin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here_ FUN C5_ | D E V R E D O P _MTSYCPCEL(_MIaMxP,L _iKnEtR3N2E_Lt_,E NfTaRlYs_eF)U;N C _| D ^E VREDOP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:T402Y:P3E:( Mnote: aexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'x , int32 _402t | , f amlsscec)l;R u n| I ^n te/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:p402r:e3t:e rnote: t,e rPo(pca,l gPor,o twooLrLk1)2;8 ,\ f u| l ^l Ops>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:c165o:m33m:, note: auninitialized use occurs herel go, w o165r | k ) ; \c o p| y ^T oShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154: 105: | warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | M S C C Lc_aIsMeP L3_:K E R| N ^E L_ENTR/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppY:_5F:U9N:C _note: Din instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested hereE VREDO P5_ | T Y P E ( M a x ,M SiCnCtL3_2I_MtP,L _fKaElRsNeE)L;_ E N| T ^R Y_FUN/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hC:_402D:E3V:R Enote: Dexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'O P_TYP E402( | M a xm,s cicnltR3u2n_Itn,t efraplrseet)e;r < t| y ^p e, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hF:u405n:c3#:# dnote: eexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'v redop , mPsrcoctloRLuLn1I2n8t,e rfpurleltOeprs<>t(ycpoem,m ,F uanlcg#o#,d ewvorrekd)o;p <\t y p| e ^> , P/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:o165t:o33S:i mnote: puninitialized use occurs herel e),; f u| l ^~~l Ops>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:c162o:m5m:, warning: avariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]l go ,162 | w o r k )d;e f\a u l| t ^: | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h ^~~~~~~: 165:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h33::165 :note: 33uninitialized use occurs here: note: uninitialized use occurs here 165 | 165 | c o p y TcooSphymTeomS8h(mteimd8%(WtAiRdP%_WSAIRZPE_,S IdZsEt,, dssrtc,, sbryct,e sb)y;t e s| ) ^~~; | ^~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(ntIn file included from h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cppr:e1a: dIn file included from s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h):,13 : wIn file included from i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hd:(167t: i/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hd:%562W:A15R:P _warning: Sinitializer order does not match the declaration order [-Wreorder-ctor]I ZE), warp (562t | i d / W AtRiPd_(StIiZdE)),, n t| h ~~~~~~~~~~~~~~~~~~r e a| d stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)s (nth r508e | a d s ) ,w atripdIInnBBlloocckk((tthhrreeaaddIIddxx..x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ (tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 94%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t dataIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp :3991 | : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:s13c: cIn file included from l/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hR:u169n: I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hn:t271e:r19p:r ewarning: tunused variable 'ptr' [-Wunused-variable]e rt,* Pprtort o=L Lr,e cfvuPltlrO(p0s)>+(lclo1m2m8,O faflsgeot,; w o| r ^~~k ); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ :33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hsrc,: 405b:y3t:e snote: )expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 162405: | 5 : mwarning: svariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]c cl R162u | n I n t edrepfraeutletr:< t y| p ^~~~~~~e , /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hF:u165n:c33#:# dnote: euninitialized use occurs herev red o165p | < t y p ec>o,p yPTrooSthomSeimm8p(ltei, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 134note: :field 'group' will be initialized after field 'stepSize'14 : note: initialize the variable 'dst' to silence this warning 562 | 134 | t ivdo(itdi d*)d,s tn,t h*rseracd;s ( n| t ^h r e| a = nullptrd s), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNCIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr _DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 93%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, fla/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ g2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp::1541:: 10/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:: 154warning: :variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]10 : warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | 154c | a s e 3c:a s e| ^3 : | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp: 55: | 9 : note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here MSCC L5_ | I M P L _ K E R NMESLC_CELN_TIRMYP_LF_UKNECR_NDEELV_REENDTORPY__TFYUPNEC(_MDaExV,R EuDiOnPt_6T4Y_PtE,( Mfaaxl,s eu)i;n t 6| 4 ^_ t, f/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:l402s:e3):; note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'| ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h402: | 399 : 3m:s cnote: cexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'l RunInte r399p | r e tmesrcc,# #PdreovtroeLdLo1p2<8t,y pfeu>l,l OPprso>t(ocLoLm,m ,f ualllgOop,s >w(ocrokm)m;, \a l g| o ^, wo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hr:k165):;33 :\ note: uninitialized use occurs here| ^ 165 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hc:o165p:y33T:o Snote: huninitialized use occurs herem em8(ti d165% | W A R P _cSoIpZyET,o Sdhsmte,m 8s(rtci,d %bWyAtRePs_)S;I Z E| , ^~~ dst, /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:r162c:,5 :b ywarning: tvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]e s) ;162 | | ^~~ defaul/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ht::162 : 5| : ^~~~~~~ warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :165: 33162: | note: uninitialized use occurs here de f165a | u l t : c o| p ^~~~~~~y ToS/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hh:m165e:m338:( tnote: iuninitialized use occurs hered %WAR P165_ | S I Z E ,c odpsytT,o Sshrmce,m 8b(yttieds%)W;A R P| _ ^~~S IZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :| 134 ^: 14 :| = nullptrnote: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr : note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | c/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.ho:p386y:T9o:S hwarning: mvariable 'wireOffset' set but not used [-Wunused-but-set-variable]e m8(t i386d | % W A R Pi_nStI ZwEi,r edOsftf,s estr c=, WbiyrteeWso)r;d P e| r ^~~S lice*w/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ha:r162p: 5+: 2warning: *variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]w id; 162 | | ^ default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group),In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYP/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:(134M:a14x:, note: hinitialize the variable 'dst' to silence this warninga lf, 134f | a l s e )v;o i d| ^* dst,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :*399s:r3c:; note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'| ^ | = nullptr399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMax<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMax<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 95%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNELIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33:_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/git_version.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/git_version.cpp.o -MF CMakeFiles/rccl.dir/git_version.cpp.o.d -o CMakeFiles/rccl.dir/git_version.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/git_version.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | inIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ t offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShme/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hm:8154(:t10i:d %warning: Wvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]A RP_SIZ E154, | d s t ,c assrec ,3 :b y t| e ^s ); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::9162:: 5note: :in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 5162 | | d e f aMuSlCtC:L _ I| M ^~~~~~~P L_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hK:E165R:N33E:L _note: Euninitialized use occurs hereN TRY_ F165U | N C _ D EcVoRpEyDTOoPS_hTmYePE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr m8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:t1541:610,: fwarning: avariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]l se) ;154 | | ^ ca/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:e405 :33:: note: | expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp :4055 | : 9 :m snote: cin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herec lRunI n5t | e r p r e t e r N,C _PDrEoVtRoESDiOmPp_lTeY, f405u | l l Ompssc>c(lcRoumnmI,n taelrgpor,e tweorr, 562P | r o t o Stiimdp(lteiu,p (fgurloluOpp)s,> ( c| o ^~~~~~~~~~~~~~~~~m m,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h :a562l:g60o:, note: wfield 'group' will be initialized after field 'stepSize'o rk); \562 | | ^ t/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hi:d165(:t33i:d )note: ,uninitialized use occurs here nthr e165a | d s ( n tchorpeyaTdosS)h,m etmi8d(ItniBdl%oWcAkR(Pt_hSrIeZaEd,I ddxs.tx,) ,s rgcr,o ubpy(tgerso)u;p ) ,| ^~~ | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ :165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Max, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:In file included from 165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp::331:: In file included from note: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.huninitialized use occurs here: 13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167 : 165/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h | : 562 : 15 :c owarning: pinitializer order does not match the declaration order [-Wreorder-ctor]y ToShmem8( t562i | d % W A RtPi_dS(ItZiEd,) ,d sntt,h rseracd,s (bnyttherse)a;d s )| , ^~~ tidInBlock/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:t162h:r5e:a dwarning: Ivariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]d x. x162) | , g r oduepf(agurlotu:p ) ,| ^~~~~~~ | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165| : tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)33 : note: uninitialized use occurs here 563 | 165 | s tceoppSyiTzoeS(hnmcecml8S(htmiedm%.WcAoRmPm_.SbIuZfEf,S idzsets,[ NsCrCcL,_ PbRyOtTeOs_)S;I M P| L ^~~E ]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSC/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hC:L134_:I14M:P Lnote: _initialize the variable 'dst' to silence this warningK ERN E134L | _ E N T RvYo_iFdU N*Cd_sDtE,V R*EsDrOcP;_ T Y| P ^E ( M| i = nullptrn , int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h| : ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~154 : 10| : group(group warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :154217 | : 57 : note: cin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested herea se 3 :217 | | ^P rimitives, ProtoSimple<2, 2>, false>' requested here, FanA s5y | m m e t r i c < 1M,S1C>C,L _1I,M PPLr_oKtEoR,N E0L>_ EpNrTiRmYs_ F U| N ^C _DEV/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cppR:E5D:O9P:_ Tnote: Yin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested hereP E(Mi n5, | i n t 8 _ t , MfSaClCsLe_)I;M P L| _ ^K ERNE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hL:_405E:N3T:R Ynote: _expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'F UNC_DE V405R | E D OmPs_cTcYlPREu(nMIinnt,e ripnrte8t_etr,< tfyaples,e )F;u n c| # ^# devr/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.he:d402o:p3<:t ynote: pexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'e >, P r402o | t o SmismcpclleRL,_ SPLrIoCtEoSLTLE1P2S8>,, ffuullllOOppss>>((ccoommmm,, aallggoo,, wwoorrkk));; \\ | | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/host_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 99%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/device_table.cpp gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9In file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp :note: 1in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13 : 5In file included from | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h : 168 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h : 153 : 14M:S Cwarning: Cunused variable 'data1' [-Wunused-variable]L _IMPL_KERNE L153_ | E N T R Yu_iFnUtN3C2__t dataD1E,V RfElDaOgP1_,T YdPaEt(aM2i,n ,f liangt23;2 _ t| , ^~~~~ fals/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.he:)153;: 21 :| ^warning: unused variable 'flag1' [-Wunused-variable] /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h: 399153: | 3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'u int3 2399_ | t dmastcac1l,R ufnlIangt1e,r pdraettae2r,< tfylpaeg,2 ;F u n| c ^~~~~# #d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.he:v153r:e28d:o pwarning: 153, | P r o tuoiLnLt,3 2f_utl ldOaptsa>1(,c ofmlma,g 1a,l gdoa,t aw2o,r kf)l;a g\2 ; | ^| ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h::165153::3335:: note: warning: uninitialized use occurs hereunused variable 'flag2' [-Wunused-variable] 153 | 165 | u icnotp3y2T_otS hdmaetma81(,t ifdl%aWgA1R,P _dSaItZaE2,, dfslta,g 2s;r c ,| ^~~~~b ytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVRE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr DOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ .comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 96%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_In file included from S/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cppI:M1P: L/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hE:]154/:N10C:C Lwarning: _variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]S TEPS/s i154z | e o f ( Tc)a)s e{ 3 :| ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | ^| group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp::575:: 9note: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 217 | 5 | P r i m i t i v eMsSE,V R1E,D OPPr_oTtYoP,E (0M>i np,r iumisn t 8| _ ^t , fal/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpps:e5):;9 : | note: ^in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :5399 | : 3 : note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' MSC C399L | _ I MmPsLc_cKlERRuNnEILn_tEeNrTpRrYe_tFeUrN_,t ,P rfoatlosLeL),; f u| l ^l Ops>/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h(:c405o:m3m:, note: aexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'l go, wor k405) | ; \m s c| c ^l Run/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hI:n165t:e33r:p rnote: euninitialized use occurs heret erA,R PP_rSoItZoES,i mdpslte,< MsSrCcC,L _bCyHtUeNsK)S;T E P| S ^~~/ MSCCL/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h_:S162L:I5C:E Swarning: Tvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]E PS ,162 | M S C C Ld_eSfLaIuClEtS:T E P| ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr S>, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ nBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint8_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%W/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.hA:R386P:_9S:I Zwarning: Evariable 'wireOffset' set but not used [-Wunused-but-set-variable]) , warp (386t | i d / W AiRnPt_ SwIiZrEe)O,f f s| e ~~~~~~~~~~~~~~~~~~t =| stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)W ireW o508r | d P e r SwlaircpeI*nwBalropc k+( t2h*rweiadd;I d x| . ^x /WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr ads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint32_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ :21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple | , f u lclaOspes >3(:c o m| m ^, algo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp,: 5w:o9r:k )note: ;in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here \ | ^5 | /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165 : 33 :M Snote: Cuninitialized use occurs hereC L_IMP L165_ | K E R N EcLo_pEyNTToRSYh_mFeUmN8C(_tDiEdV%RWEADROPP__STIYZPEE,( Mdisnt,, isnrtc6,4 _bty,t efsa)l;s e )| ; ^~~ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h162::4025::3 :warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 162 | 402 | d emfsacuclltR:u n I| n ^~~~~~~t er/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hp:r165e:t33e:r t,i dP%rWoAtRoPL_LS1I2Z8E,, fdusltl,O pssr>c(,c obmymt,e sa)l;g o ,| ^~~w ork); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h165: | 134 : 14 : cnote: oinitialize the variable 'dst' to silence this warningp yTo S134h | m e m 8 (vtoiidd% W*AdRsPt_,S I*ZsEr,c ;d s t| , ^ s r| c = nullptr, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp134: | 1 : In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :v13o: iIn file included from d/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h :*167d: s/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.ht:,562 :*15s:r cwarning: ;initializer order does not match the declaration order [-Wreorder-ctor] | ^ | = nullptr 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hIn file included from :/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h217::1357: :In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.hnote: :in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here169 : /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509 :21729 | : warning: Pfield 'group' will be initialized after field 'stepSize' [-Wreorder-ctor]r imitives <507T | , R e dtOipd,( tFiadn)A,s ynthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ mmetric<1,1>, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSize/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hs:[154N:C10C:L _warning: Pvariable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]R OTO _154L | L 1 2 8 ]c/aNsCeC L3_:S T E| P ^S /sizeo/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cppf:(5u:i9n:t 6note: 4in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here_ t)) {5 | | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group MSCCL_I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hM:P217L:_57K:E Rnote: Nin instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested hereE L_EN T217R | Y _ FPUrNiCm_iDtEiVvReEsD;, 1| , ^ Prot/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.ho:,402 :03>: pnote: rexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'i ms | ^402 | ms/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cppc:c5l:R9u:n Inote: nin instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested heret erpr e5t | e r < t y p e , MFSuCnCcL#_#IdMePvLr_eKdEoRpNT,R YP_rFoUtNoCL_LD1E2V8R,E DfOuPl_lTOYpPsE>((Mcionm,m ,i natl6g4o_,t ,w ofrakl)s;e )\; | | ^ ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h::165402::333:: note: note: uninitialized use occurs hereexpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 165402 | | m sccocplyRTuonSIhnmteemr8p(rteitde%rW;, P| r ^~~o toLL1/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h2:8162,: 5f:u lwarning: lvariable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized]O ps> (162c | o m m , daelfgaou,l tw:o r k| ) ^~~~~~~; \/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h : 165| : ^33 : note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple:,9 :f unote: lin instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested herel Ops>( c5o | m m , a l g o ,M SwCoCrLk_)I;M P\L _ K| E ^R NEL_/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.hE:N562T:R15Y:_ Fnote: Ufield 'nthreads' will be initialized after field 'tidInBlock'N C_DEV R562E | D O P _ TtYiPdE((tMiidn),, inntth6r4e_atd,s (fnatlhsree)a;d s )| , ^ tidI/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hn:B405l:o3c:k (note: texpanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'h readIdx .405x | ) , mgsrcoculpR(ugnrIonutpe)r,p r e| t ^~~~~~~~~~~~~~~~~e rd,) ,P rnotthorSeiamdpsl(en , fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx906. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, int64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx941. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_float.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, float, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 97%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(commIn file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ , algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, uint64_t, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STE/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ PS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE':154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 405 | m154s | c c l R ucnaIsnet e3r:p r e| t ^e r, ProtoSimple<2, 2>, false>' requested herep e>, Pro t5o | S i m p l e < M SMCSCCLC_LC_HIUMNPKLS_TKEEPRSN/EMLS_CECNLT_RSYL_IFCUENSCT_EDPESV,R EMDSOCPC_LT_YSPLEI(CMEiSnT,E PhSa>l,f ,f uflallOspes)>;( c o| m ^m , algo,/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h :w405o:r3k:) ;note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE'\ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h405: | 562 : 15m:s cnote: cfield 'nthreads' will be initialized after field 'tidInBlock'l RunI n562t | e r p r ettiedr(t,i dPIrnoBtlooScikm(ptlher(,t ifdu)l,l Onptsh>r(ecaodmsm(,n tahlrgeoa,d sw)o,r kt)i;d I\n B l| o ^c k(threa/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.hd:I165d:x33.:x )note: ,uninitialized use occurs here group( g165r | o u p ) ,c o p| y ^~~~~~~~~~~T oShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives<__half, FuncMin<__half>, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_half.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter<__half, FuncMin<__half>, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, half, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized]/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/W/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ ARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1101. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx90a. 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for host. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_double.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, double, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [ 98%] Building CXX object CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o /usr/bin/clang++ -DCOMPILE_MSCCL_KERNEL -DENABLE_COLLTRACE -DENABLE_LL128 -DHIP_UNCACHED_MEMORY -DNVTX_NO_IMPL -DROCM_VERSION=60102 -DUSE_PROF_API=1 -DUSE_ROCM_SMI64CONFIG -D__HIP_PLATFORM_AMD__=1 -Drccl_EXPORTS -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/include -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives -I/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device -pipe -frecord-gcc-switches -Wall -g -O2 -std=c++14 -fPIC -parallel-jobs=12 -Wno-format-nonliteral -fgpu-rdc -fvisibility=hidden -mllvm --amdgpu-kernarg-preload-count=16 -mllvm=-amdgpu-early-inline-all=true -mllvm=-amdgpu-function-calls=false -x hip --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 -MD -MT CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -MF CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o.d -o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o -c /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for gfx1100. 15 warnings generated when compiling for gfx1102. 15 warnings generated when compiling for gfx940. 15 warnings generated when compiling for gfx900. 15 warnings generated when compiling for gfx906. 15 warnings generated when compiling for gfx1030. 15 warnings generated when compiling for gfx941. 15 warnings generated when compiling for gfx803. 15 warnings generated when compiling for gfx908. 15 warnings generated when compiling for gfx1101. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:168: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:14: warning: unused variable 'data1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:21: warning: unused variable 'flag1' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:28: warning: unused variable 'data2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:153:35: warning: unused variable 'flag2' [-Wunused-variable] 153 | uint32_t data1, flag1, data2, flag2; | ^~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll.h:514:9: warning: variable 'offset' set but not used [-Wunused-but-set-variable] 514 | int offset = tid; | ^ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:271:19: warning: unused variable 'ptr' [-Wunused-variable] 271 | uint64_t* ptr = recvPtr(0)+ll128Offset; | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:386:9: warning: variable 'wireOffset' set but not used [-Wunused-but-set-variable] 386 | int wireOffset = WireWordPerSlice*warp + 2*wid; | ^ 15 warnings generated when compiling for gfx90a. In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:399:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 399 | mscclRunInterpreter, ProtoLL, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:169: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_ll128.h:509:29: warning: field 'group' will be initialized after field 'stepSize' [-Wreorder-ctor] 507 | tid(tid), nthreads(nthreads), wid(tid%WARP_SIZE), warp(tid/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~ | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t) 508 | warpInBlock(threadIdx.x/WARP_SIZE), | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | warp(tid/WARP_SIZE 509 | flagThread((tid%4)==3), group(group), | ~~~~~~~~~~~~~~~~~~~~~ ^~~~~~~~~~~ | warpInBlock(threadIdx.x/WARP_SIZE flagThread((tid%4)==3 510 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_LL128]/NCCL_STEPS/sizeof(uint64_t)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoLL128, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoLL128, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:402:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 402 | mscclRunInterpreter, ProtoLL128, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:154:10: warning: variable 'dst' is used uninitialized whenever switch case is taken [-Wsometimes-uninitialized] 154 | case 3: | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:162:5: warning: variable 'dst' is used uninitialized whenever switch default is taken [-Wsometimes-uninitialized] 162 | default: | ^~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:165:33: note: uninitialized use occurs here 165 | copyToShmem8(tid%WARP_SIZE, dst, src, bytes); | ^~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:134:14: note: initialize the variable 'dst' to silence this warning 134 | void *dst, *src; | ^ | = nullptr In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:1: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:13: In file included from /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/primitives.h:167: /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: warning: initializer order does not match the declaration order [-Wreorder-ctor] 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ ~~~~~~~~~~~~~~~~~~~~~~ ~~~~~~~~~~~ | tidInBlock(threadIdx.x nthreads(nthreads stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T) 563 | stepSize(ncclShmem.comm.buffSizes[NCCL_PROTO_SIMPLE]/NCCL_STEPS/sizeof(T)) { | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ | group(group /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:217:57: note: in instantiation of member function 'Primitives, FanAsymmetric<1, 1>, 1, ProtoSimple<2, 2>, 0>::Primitives' requested here 217 | Primitives, 1, Proto, 0> prims | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp:5:9: note: in instantiation of function template specialization 'mscclRunInterpreter, ProtoSimple<2, 2>, false>' requested here 5 | MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE(Min, rccl_bfloat16, false); | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/msccl_kernel_impl.h:405:3: note: expanded from macro 'MSCCL_IMPL_KERNEL_ENTRY_FUNC_DEVREDOP_TYPE' 405 | mscclRunInterpreter, ProtoSimple, fullOps>(comm, algo, work); \ | ^ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:15: note: field 'nthreads' will be initialized after field 'tidInBlock' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~~~~~~~ /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/hipify/src/collectives/device/prims_simple.h:562:60: note: field 'group' will be initialized after field 'stepSize' 562 | tid(tid), nthreads(nthreads), tidInBlock(threadIdx.x), group(group), | ^~~~~~~~~~~ 15 warnings generated when compiling for host. 15 warnings generated when compiling for gfx942. gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[2]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Linking CXX shared library librccl.so /usr/bin/cmake -E cmake_link_script CMakeFiles/rccl.dir/link.txt --verbose=1 /usr/bin/cmake -E time /usr/bin/clang++ -fPIC -pipe -frecord-gcc-switches -Wall -g -O2 -parallel-jobs=16 -Xoffload-linker -mllvm=-amdgpu-kernarg-preload-count=16 -shared -Wl,-soname,librccl.so.1 -o librccl.so.1.0 CMakeFiles/rccl.dir/hipify/src/bootstrap.cc.o CMakeFiles/rccl.dir/hipify/src/channel.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_all.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/all_to_allv.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/broadcast.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/msccl.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/onerank_reduce.cu.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/gather.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/reduce_scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/scatter.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/sendrecv.cc.o CMakeFiles/rccl.dir/hipify/src/debug.cc.o CMakeFiles/rccl.dir/hipify/src/enqueue.cc.o CMakeFiles/rccl.dir/hipify/src/graph/connect.cc.o CMakeFiles/rccl.dir/hipify/src/graph/paths.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rings.cc.o CMakeFiles/rccl.dir/hipify/src/graph/rome_models.cc.o CMakeFiles/rccl.dir/hipify/src/graph/search.cc.o CMakeFiles/rccl.dir/hipify/src/graph/topo.cc.o CMakeFiles/rccl.dir/hipify/src/graph/trees.cc.o CMakeFiles/rccl.dir/hipify/src/graph/tuning.cc.o CMakeFiles/rccl.dir/hipify/src/graph/xml.cc.o CMakeFiles/rccl.dir/hipify/src/group.cc.o CMakeFiles/rccl.dir/hipify/src/init.cc.o CMakeFiles/rccl.dir/hipify/src/misc/archinfo.cc.o CMakeFiles/rccl.dir/hipify/src/misc/argcheck.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvsymbols.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ibvwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/ipcsocket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_lifecycle.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_parser.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_setup.cc.o CMakeFiles/rccl.dir/hipify/src/misc/msccl/msccl_status.cc.o CMakeFiles/rccl.dir/hipify/src/misc/npkit.cc.o CMakeFiles/rccl.dir/hipify/src/misc/nvmlwrap_stub.cc.o CMakeFiles/rccl.dir/hipify/src/misc/param.cc.o CMakeFiles/rccl.dir/hipify/src/misc/profiler.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocm_smi_wrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/rocmwrap.cc.o CMakeFiles/rccl.dir/hipify/src/misc/shmutils.cc.o CMakeFiles/rccl.dir/hipify/src/misc/signals.cc.o CMakeFiles/rccl.dir/hipify/src/misc/socket.cc.o CMakeFiles/rccl.dir/hipify/src/misc/strongstream.cc.o CMakeFiles/rccl.dir/hipify/src/misc/utils.cc.o CMakeFiles/rccl.dir/hipify/src/net.cc.o CMakeFiles/rccl.dir/hipify/src/proxy.cc.o CMakeFiles/rccl.dir/hipify/src/transport.cc.o CMakeFiles/rccl.dir/hipify/src/transport/coll_net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_ib.cc.o CMakeFiles/rccl.dir/hipify/src/transport/net_socket.cc.o CMakeFiles/rccl.dir/hipify/src/transport/nvls.cc.o CMakeFiles/rccl.dir/hipify/src/transport/p2p.cc.o CMakeFiles/rccl.dir/hipify/src/transport/shm.cc.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllGather_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_TREE_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_DIRECT_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllReduce_COLLNET_CHAIN_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/AllToAllPivot_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Broadcast_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/Reduce_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_LL128_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Prod.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Max.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_Min.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_PreMulSum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/ReduceScatter_RING_SIMPLE_SumPostDiv.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/SendRecv_RING_SIMPLE_Sum.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Sum_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Prod_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Max_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint8_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint32_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_int64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_uint64_t.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_half.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_float.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_double.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/msccl_kernel_Min_rccl_bfloat16.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/device_table.cpp.o CMakeFiles/rccl.dir/hipify/src/collectives/device/host_table.cpp.o CMakeFiles/rccl.dir/git_version.cpp.o -fgpu-rdc -ldl /usr/lib64/librocm_smi64.so.1.0 /usr/lib64/libamdhip64.so.6.1.40093 --hip-link --offload-arch=gfx803 --offload-arch=gfx900:xnack- --offload-arch=gfx906:xnack- --offload-arch=gfx908:xnack- --offload-arch=gfx90a:xnack- --offload-arch=gfx90a:xnack+ --offload-arch=gfx940 --offload-arch=gfx941 --offload-arch=gfx942 --offload-arch=gfx1030 --offload-arch=gfx1100 --offload-arch=gfx1101 --offload-arch=gfx1102 /usr/lib/llvm-rocm/lib64/clang/17/lib/linux/libclang_rt.builtins-x86_64.a -lpthread -lrt -ldl Elapsed time (seconds): 439.843 /usr/bin/cmake -E cmake_symlink_library librccl.so.1.0 librccl.so.1 librccl.so gmake[2]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake[1]: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' [100%] Built target rccl gmake[1]: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' gmake: Entering directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' /usr/bin/cmake -E cmake_progress_start /usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux/CMakeFiles 0 gmake: Leaving directory '/usr/src/RPM/BUILD/rccl-2.18.6/x86_64-alt-linux' + exit 0 Executing(%install): /bin/sh -e /usr/src/tmp/rpm-tmp.51477 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + /bin/chmod -Rf u+rwX -- /usr/src/tmp/rccl-buildroot + : + /bin/rm -rf -- /usr/src/tmp/rccl-buildroot + PATH=/usr/libexec/rpm-build:/usr/src/bin:/usr/bin:/bin:/usr/local/bin:/usr/games + cd rccl-2.18.6 + DESTDIR=/usr/src/tmp/rccl-buildroot + cmake --install x86_64-alt-linux --verbose -- Install configuration: "" -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1.0 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1 -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allgather-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-16tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-16n-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-1pass.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-32tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-ll-64tb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/allreduce-allpairs-8n-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-0-1kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1kb-4kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-1mb-4mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4kb-8kb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-4mb-16mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-algorithms/alltoall-8n-8kb-1mb.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-ll128.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/rccl/msccl-unit-test-algorithms/all-reduce-ring-simple.xml -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-targets-noconfig.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/lib64/cmake/rccl/rccl-config-version.cmake -- Installing: /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl/LICENSE.txt + rm -rf /usr/src/tmp/rccl-buildroot/usr/rccl + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/rccl + /usr/lib/rpm/brp-alt Cleaning files in /usr/src/tmp/rccl-buildroot (auto) mode of './usr/lib64/librccl.so.1.0' changed from 0755 (rwxr-xr-x) to 0644 (rw-r--r--) Verifying and fixing files in /usr/src/tmp/rccl-buildroot (binconfig,pkgconfig,libtool,desktop,gnuconfig) Checking contents of files in /usr/src/tmp/rccl-buildroot/ (default) Compressing files in /usr/src/tmp/rccl-buildroot (auto) Adjusting library links in /usr/src/tmp/rccl-buildroot ./usr/lib64: (from :0) librccl.so.1 -> librccl.so.1.0 Verifying ELF objects in /usr/src/tmp/rccl-buildroot (arch=normal,fhs=normal,lfs=relaxed,lint=relaxed,rpath=normal,stack=normal,textrel=normal,unresolved=normal) section [ 3] '.dynsym': symbol 338 (__hip_fatbin): symbol in dynamic symbol table with non-default visibility verify-elf: WARNING: ./usr/lib64/librccl.so.1.0: eu-elflint failed Splitting links to aliased files under /{,s}bin in /usr/src/tmp/rccl-buildroot Processing files: librccl1-2.18.6-alt0.1 Executing(%doc): /bin/sh -e /usr/src/tmp/rpm-tmp.36313 + umask 022 + /bin/mkdir -p /usr/src/RPM/BUILD + cd /usr/src/RPM/BUILD + cd rccl-2.18.6 + DOCDIR=/usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + export DOCDIR + rm -rf /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + /bin/mkdir -p /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + cp -prL README.md LICENSE.txt NOTICES.txt CHANGELOG.md /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R go-w /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + chmod -R a+rX /usr/src/tmp/rccl-buildroot/usr/share/doc/librccl1-2.18.6 + exit 0 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.XCFMeH find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) lib.prov: /usr/src/tmp/rccl-buildroot/usr/lib64/librccl.so.1: 192 symbols, 18 bpp Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.1vvu9E find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) warning: librccl1 provides another subpackage: rccl Provides: rccl = 2.18.6-alt0.1, librccl.so.1()(64bit) = set:ldySY8WxOALBnhFpKYr8hTuOp4f4mGu2jLdMJjcZCXM47UXuwyyGRGWXKgETcgdjMi5wuDQ3qOxtZBm81J7pYPMIUZa5VdctQkKefUrjndPqhuFfak8KACxDBZ2WZJDfvJzZ89VmVuIkNiinUuRvWX09AlpiViW0mDiqb8i3YJossrximfgU5FDIg3bfAM3p87RAKcG4MZinBzsSGNgsBCROo9k0v79172vNT21EO938Mcw8TzCb018bhHvvzgmTvhhNQWFQoI4SSRedfYZyMcS4HABqmacW4xzCUZaO5x9LSUxVFl0qy5C7FFGgAn04Hyxww4hPwz6LsL4UDEnEe2dpGZx29zB56rIHYGcZG1BqjQafIX1WE3sbDhXCpfBjMq4 Requires: ld-linux-x86-64.so.2()(64bit) >= set:jiids, ld-linux-x86-64.so.2(GLIBC_2.3)(64bit), libamdhip64.so.6()(64bit) >= set:mgEl4iHah5shPP2z5A5zYttYI7XpZyRnhe1J6ZgwULwPlWeYZ4XbZd2bItRMqeW4hZmmUYmDZdpDnrYqkUKOuzfUwKzIyQItN97gggSsa6v6KYBa3m70aJ49gh1ckMQcuEPMZKgWZw, libamdhip64.so.6(hip_4.2)(64bit), libamdhip64.so.6(hip_4.3)(64bit), libamdhip64.so.6(hip_4.5)(64bit), libamdhip64.so.6(hip_5.0)(64bit), libamdhip64.so.6(hip_5.3)(64bit), libamdhip64.so.6(hip_6.0)(64bit), libc.so.6(GLIBC_2.14)(64bit), libc.so.6(GLIBC_2.17)(64bit), libc.so.6(GLIBC_2.2.5)(64bit), libc.so.6(GLIBC_2.3)(64bit), libc.so.6(GLIBC_2.3.2)(64bit), libc.so.6(GLIBC_2.3.4)(64bit), libc.so.6(GLIBC_2.33)(64bit), libc.so.6(GLIBC_2.34)(64bit), libc.so.6(GLIBC_2.38)(64bit), libc.so.6(GLIBC_2.6)(64bit), libgcc_s.so.1(GCC_3.0)(64bit), libm.so.6(GLIBC_2.2.5)(64bit), librocm_smi64.so.1()(64bit) >= set:miSwa9ZECgdMsH9hGiyEU5mNQ1, libstdc++.so.6(CXXABI_1.3)(64bit), libstdc++.so.6(CXXABI_1.3.5)(64bit), libstdc++.so.6(CXXABI_1.3.7)(64bit), libstdc++.so.6(GLIBCXX_3.4)(64bit), libstdc++.so.6(GLIBCXX_3.4.11)(64bit), libstdc++.so.6(GLIBCXX_3.4.18)(64bit), libstdc++.so.6(GLIBCXX_3.4.19)(64bit), libstdc++.so.6(GLIBCXX_3.4.21)(64bit), libstdc++.so.6(GLIBCXX_3.4.22)(64bit), libstdc++.so.6(GLIBCXX_3.4.29)(64bit), rtld(GNU_HASH) Requires(rpmlib): rpmlib(SetVersions) Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.BU0YDw Creating librccl1-debuginfo package Processing files: librccl-devel-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.5UZwun find-provides: running scripts (alternatives,debuginfo,lib,pam,perl,pkgconfig,python,python3,shell) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.t40WFV find-requires: running scripts (cpp,debuginfo,files,lib,pam,perl,pkgconfig,pkgconfiglib,python,python3,rpmlib,shebang,shell,static,symlinks,systemd-services) In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:12: /usr/include/hip/hip_runtime.h:66:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 66 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:70: /usr/include/hip/hip_runtime_api.h:8852:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 8852 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:71: /usr/include/hip/library_types.h:75:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 75 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/include/hip/hip_runtime.h:73: /usr/include/hip/hip_vector_types.h:38:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 38 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ In file included from /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h:13: /usr/include/hip/hip_fp16.h:33:2: error: #error ("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); 33 | #error("Must define exactly one of __HIP_PLATFORM_AMD__ or __HIP_PLATFORM_NVIDIA__"); | ^~~~~ cpp.req: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed, trying c++ mode x86_64-alt-linux-cpp: fatal error: cannot execute 'cc1plus': execvp: No such file or directory compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/rccl.h: cpp failed /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h:10:10: fatal error: nccl.h: No such file or directory 10 | #include "nccl.h" | ^~~~~~~~ compilation terminated. cpp.req: WARNING: /usr/src/tmp/rccl-buildroot/usr/include/rccl/nccl_net.h: cpp failed Provides: rccl-devel = 2.18.6-alt0.1 Requires: /usr/lib64/librccl.so.1 Finding debuginfo files (using /usr/lib/rpm/find-debuginfo-files) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.6UcuEh Processing files: librccl1-debuginfo-2.18.6-alt0.1 Finding Provides (using /usr/lib/rpm/find-provides) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.sVaGxD find-provides: running scripts (debuginfo) Finding Requires (using /usr/lib/rpm/find-requires) Executing: /bin/sh -e /usr/src/tmp/rpm-tmp.IEeW80 find-requires: running scripts (debuginfo) Provides: debug64(librccl.so.1) Requires: librccl1 = 2.18.6-alt0.1, debug64(ld-linux-x86-64.so.2), debug64(libamdhip64.so.6), debug64(libc.so.6), debug64(libgcc_s.so.1), debug64(libm.so.6), debug64(librocm_smi64.so.1), debug64(libstdc++.so.6) Adding to librccl1-debuginfo a strict dependency on librccl1 Adding to librccl-devel a strict dependency on librccl1 Removing 1 extra deps from librccl-devel due to dependency on librccl1 Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-2.18.6-alt0.1.x86_64.rpm (w2T16.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl-devel-2.18.6-alt0.1.x86_64.rpm (w2T16.xzdio) Wrote: /usr/src/RPM/RPMS/x86_64/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm (w2.lzdio) 19181.50user 783.12system 23:42.47elapsed 1403%CPU (0avgtext+0avgdata 5530940maxresident)k 4608inputs+0outputs (91major+87103950minor)pagefaults 0swaps /.out/librccl1-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl-devel-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // /.out/librccl1-debuginfo-2.18.6-alt0.1.x86_64.rpm: bad symbols in the license tag: // 8.08user 5.77system 25:18.93elapsed 0%CPU (0avgtext+0avgdata 135932maxresident)k 1768856inputs+0outputs (0major+333561minor)pagefaults 0swaps